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Abstract: Let Xi ,... ,Xn be independent nonnegative random variables 
(r.v.’s), with Sn := Xi + Xn and finite values of Si := E Xf and mi := 
EXi > 0 . Exact upper bounds on E/(S'ti) for all functions f in & certain 
class ^ of nonincreasing functions are obtained, in each of the following 
settings: (i) n, mi,..., mn, si,..., are fixed; (ii) n, m := mi -h • • • -|- mn, 
and s := Si H- • • • -|- Sn are fixed; (hi) only m and s are fixed. These upper 
bounds are of the form E /(r;) for a certain r.v. 77 . The r.v. 77 and the class ^ 
depend on the choice of one of the three settings. In particular, {ra/s)rj has 
the binomial distribution with parameters n and p := m^/(ns) in setting 
(ii) and the Poisson distribution with parameter A := /s in setting (hi). 
One can also let p have the normal distribution with mean m and variance 
s in any of these three settings. In each of the settings, the class ^ contains, 
and is much wider than, the class of all decreasing exponential functions. 
As corollaries of these results, optimal in a certain sense upper bounds on 
the left-tail probabilities P{Sn ^ x) are presented, for any real x. In fact, 
more general settings than the ones described above are considered. Exact 
upper bounds on the exponential moments Eex.p{hSn} for h < 0 , as well 
as the corresponding exponential bounds on the left-tail probabilities, were 
previously obtained by Pinelis and Utev. It is shown that the new bounds 
on the tails are substantially better. 
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60G48. 
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References 


1. Introduction 


Let Xi,..., X„ be independent real-valued random variables (r.v.’s), with 

Sn ■= Xi Xn- 


Exponential upper bounds for S'„ go back at least to Bernstein. As the starting 
point here, one uses the multiplicative property of the exponential function 
together with the condition of independence of Xi ,..., to write 

n 

Ee'*®" (1.1) 

1 


for all real h. Then one bounds up each factor E , thus obtaining an upper 
bound (say Mn{h)) on Ee^'^", uses the Markov inequality to write P(5'„ ^ cc) < 
g-hx ^ Bn{h,x) := Mn{h) for all real x and all nonnegative real h, 

and finally tries to minimize Bn{h, x) in h ^ 0 to obtain an upper bound on the 
tail probability P(S'n ^ a;)- 

This approach was used and further developed in a large number of papers, 
including notably the well-known work by Bennett [T] and Hoeffding [T3] . Pinelis 
and Utev [55] offered a general approach to obtaining exact bounds on the 
exponential moments Ee^'^", with a number of particular applications. 

Exponential bounds were obtained in more general settings as well, where the 
r.v.’s Ai,... ,Xn do not have to be independent or real-valued. It was already 
mentioned by Hoeffding at the end of Section 2 in [T4| that his results remain 
valid for martingales. Exponential inequalities with optimality properties for 
vector-valued Ai,..., A„ were obtained e.g. in [571115] and then used in a large 
number of papers. 

Related to this is work on Rosenthal-type and von Bahr-Esseen-type bounds, 
that is, bounds on absolute power moments E|S'„|p of Sn] see e.g. [46l liTl [26l 


[I51[3[I3[11I11[51|51|51]- 

However, the classes of exponential functions e^' and absolute power functions 
I • are too narrow in that the resulting bounds on the tails are not as good 
as one could get in certain settings. It is therefore natural to try to consider 
wider classes of moment functions and then try to choose the best moment 
function in such a wider class to obtain a better bound on the tail probability. 
This approach was used and developed in [ini [IS dsi nil HI [55], in particular. 
The main difficulty one needs to overcome working with such, not necessarily 
exponential, moment functions is the lack of multiplicative property (1.1). 

In some settings, the bounds can be improved if it is known that the r.v.’s 
Ai,..., Aji are nonnegative; see e.g. [m 0 dsnu. However, in such settings 
the focus has usually been on bounds for the right tail of the distribution of 5'„. 
There has been comparatively little work done concerning the left tail of the 
distribution of the sum Sn of nonnegative r.v.’s Ai,..., A„. 
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One such result was obtained in [3S]. Suppose indeed that the independent 
r.v.’s Xi,..., Xn are nonnegative. Also, suppose here that 


ui I — E Xi -t- • • • -p E X^ > 0 and s — E X‘^ -t- • • • H- E X‘^ oc . 

Then [351 Theorem 7] for any x G (0, m] 

{x — mY 


( 1 . 2 ) 


P(S'„ < a:) ^ exp I - — [l + — In —| < exp | 
L s V m emJ) I 


2s 


} (1-3) 


(in fact, these inequalities were stated in |38j in the equivalent form for the 
non-positive r.v.’s —Xi ,..., —Xn)- These upper bounds on the tail probability 
P(S'„ ^ x) were based on exact upper bounds on the exponential moments of 
the sum Sm which can be written as follows: 


Eexp{/iS'„} Eexp < Eexp {h(m -h Z^/s)] (1.4) 


for all real h < 0. Here and subsequently, for any A G (0,oo), let H;), and Z 
stand for any r.v. having the Poisson distribution with parameter A G (0,oo) 


and for any standard normal r.v., respectively. The bounds in (1.31 and (1.41 


have certain optimality properties, and they are very simple in form. Yet, they 


have apparently been little known; in particular, the last bound in (1.3) was 
rediscovered in |3T] . 

In the present paper, the “Poisson” and “normal” bounds in (1.4) will be 


extended to a class of moment functions much wider than the “exponential” 
class (still with the preservation of the optimality property, for each moment 


function in the wider class). Consequently, the bounds in (1.3) will be much 
improved. We shall also provide “binomial” upper bounds on the moments and 
tail probabilities of Sm which are further improvements of the corresponding 
“Poisson”, and hence “normal”, bounds. 


2. Summary and discussion 


Let Xi,..., Xn be nonnegative real-valued r.v.’s. In general, we shall no longer 
assume that Yi,...,A„ are independent; instead, a more general condition, 
described in the definition below, will be assumed. Moreover, the condition 
(1.2) will be replaced by a more general one. 


Definition 2.1. Given any m = (mi,..., m„) and s = (si,..., s„) in [0, oo)", 
let us say that the r.v.’s Xi,..., Xn satisfy the (m, s)-condition if, for some filter 
{s^Q ,..., s^n-i) of sigma-algebras and each i G 1, n, the r.v. Xi is ^-measurable. 


E(Yj|M-i) ^ m*, and E(Af |^/i_i) < s^. (2.1) 


Given any nonnegative m and s, let us also say that the (m, s)-condition is 
satisfied if the (m, s)-condition holds for some m = (mi,...,m„) and s = 
(si,..., Sn) in [0, oo)” such that 

mi -I- • • • -I- m„ ^ m and si s„ ^ s. (2.2) 
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In the above definition and in what follows, for any a and /3 in Z U {oo}, we 
let a,/3 := {j e Z: a ^ j ^ /?}. 

The following comments are in order. 

• Any independent r.v.’s Xi,..., satisfy the (m, s)-condition if E ^ 
rrii and EAf ^ Si for each i G l,n; if at that 
condition holds as well. 

• If r.v.’s Xi ,..., Xn satisfy the (m, s)-condition, then the r.v.’s Ai—mi,..., 
A„ — m„ are submartingale-differences, with respect to the corresponding 
filter {p/o,... 

• If, for some m and s in [0, oo)”, the (m, s)-condition is satisfied by some 
r.v.’s Ai,..., Xn, then necessarily 

Si ^ for all i G l,n. (2.3) 


2.2) holds, then the (m, s)- 


Moreover, if, for some nonnegative m and s, the (m, s)-condition is satis¬ 
fied by some r.v.’s Ai,..., A„, then necessarily 


n ^ equivalently, (2.4) 

Definition 2.2. Given any real numbers m and s such that m > 0 and s ^ m^ 
(cf. (2.3)), let stand for any r.v. such that 


^ ym,s _ 


E(r-’«)2 = s, and P(r-.«e{0,A}) = l; 


such a r.v. A”*’* exists, and its distribution is uniquely determined: 

P(y^m,s _ = 0) = ■ 

moreover, let A™’'*,..., denote independent copies of a r.v. A*”’®. Also, 
given any m and s in (0, oo)” such that the condition (2.3) holds, we shall 
always assume the corresponding r.v.’s A"*!’®!,..., A™"’®" to be independent. 


Next, let us describe the pertinent classes of generalized moment functions. 
For any natural j, let denote the class of all (j — I)-times differentiable 
functions g: K —>■ M such that the (j — l)th derivative of g has a right- 

continuous right derivative, which will be denoted here simply by g^^\ As usual, 
we let g^^^ := g. Take then any natural 

k^j + 1 

and introduce the class of functions 

'■= {g € is nondecreasing for each i G k — l,j} (2.5) 

and, finally, the “reflected” class 

■■= {ff- : 5 e ^+'}, 


(2.6) 
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where g {x) := g{—x) for all a: G M. It is clear that the class gets narrower 
as j increases (with a fixed k), and it gets wider as k increases (with a hxed j). 

As an example, the function x <—>■ a + bx + ce~^^ belongs to for any 

oGM, 6^0, c^O, A^O (and any natural k and j such that k ^ j + 1). Also, 
given any oGM, 6^0, c^O, and w G M, the function xi-^a + bx + c{w — a;)“ 
belongs to for any real a ^ k (and any natural k and j such that k ^ j + 1); 
here and elsewhere, as usual, x+ := max(0,x) and a;“ := (a;+)“ for a; G K. Note 
also that the classes are convex cones; that is, any linear combination 

with nonnegative coefficients of functions belonging to any one of these classes 
belongs to the same class. 

Remark 2.3. It is not difficult to see that, if a function / is in the class 
then the shifted and/or rescaled function x i—>■ f{bx + a) is also in the same 
class, for any constants a G M and & ^ 0. That is, these classes of functions are 
shift- and scale-invariant. 


Now we are ready to state the main result of this paper. 

Theorem 2.4. 

(I) Let Xi,..., Xn be any nonnegative r.v.’s satisfying the {ui,s)-condition 
for some m and s in (0,oo)”, so that (2.31 holds. Then 


EfiSr^) + + 


(2.7) 


for all f G 

(II) Let Ai,...,A„ be any nonnegative r.v.’s satisfying the (m, s)-condition 
for some m and s in (0,oo), so that (|2.4[) holds. Then 


( 2 . 8 ) 

(2.9) 

( 2 . 10 ) 


E/(5„)^ £/(!/"’"++ 

^ E/(m + Z^/s) 

for all f G in fact, ( |2.10| ) and the inequality 

Ef{Sn) ^ Ef{m + Z^s) 

both hold for all f G 

The necessary proofs will be given in Section 
Remark 2.5. Under the corresponding conditions given in Theorem |2.4[ the 


( 2 . 11 ) 


expected values in inequalities (2.7|-(2.11) exist (in K or, at least, in (— 00 , 00 ]), 
according to [35l Proposition 5.2, part (i)]. Moreover, the conditions for (2.7)- 
(2.11) in Theorem 2.4 can be supplemented or relaxed as follows. To describe 
these extended or relaxed conditions for (|2.7|)-(|2.11|), introduce the conditions 
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of equalities in (2.1) and/or (2.2): 


E{Xi\£/i-i) = rrii for all z, 
EiXf\£/i-i) = St for all z, 
mi + • • • + m„ = m, 

Si + ■ • • + Sn = s 


( 2 . 12 ) 

(2.13) 

(2.14) 

(2.15) 


and also conditions 


the Xi’s are bounded or / ^ p for some quadratic polynomial p, (2.16) 

EXf < oo for all i. (2-17) 


Then 


(I) inequalities (2.71 and (2.11) hold if any one of the following two conditions 
holds: 


(i) (2.12) and / G 


(ii) (2.12), (2.13), (2.16), and / G . 


(II) inequality (2.8) holds if any one of the following three conditions holds: 


(i) (j2T2]) and / G ^2:3. 

(ii) ( [2T^ , ( pT^ , ( [2T6l ), and / G 

(hi) ([2T^, (pT^, pTTl), and / G . 


(Ill) inequality (2.9) holds if any one of the following three conditions holds: 
(i) / G ^2:3. 


(ii) plel and / G 

(iii) pTTl ) and / G . 

(IV) inequality (2.10) holds if any one of the following two conditions holds: 
(i) / G ^2:2. 


(ii) (2.16) and / G 


This remark can be verified similarly to Theorem 2.4 

Obviously, the r.v.’s (2.7) satisfy the (m, s)-condition. 

So, inequality (2.7) is exact, in the sense that, given any natural n and any m 
and s in (0,oo)" such that (2.3) holds, the right-hand side of (2.7) is the exact 
upper bound on its left-hand side. Similarly, given any natural n and any m and 
s in (0, oo) such that (2.4) holds, inequality (2.8) is exact. 

Proposition 2.6. Given any m and s in (0,oo), the Poisson upper bound 
in (|2.9[) on E/(S'„) is exact [in this case n is not fixed, having only to satisfy 
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Inequality (2.11) is best possible in the following limited sense, at least. By 
[551 Corollary 5.9], this inequality holds for all / G if and only if it holds 
for all functions / of the form /u ,,2 for re G K, where 


fw,aix) := (w - x) 


+ ■ 


(2.18) 


Let now positive m and s vary so that vr? js —> oo, which is the case e.g. when 
0 7 ^ mi = m 2 = • • •, 0 < Si = S 2 = • • •, conditions (2.14) and (2.15) hold, 
and n —>■ oo. At that, hx any real k and let w = m + Let Lm,s-,w '■= 
E/tu, 2 (^ 11 ^ 2 / 5 ), which is, according to Proposition 2. 6 l the exact upper bound 
on ^fw, 2 [Sn) given m and s. Then Lm,s-,w ~ ^ fw, 2 Jjn + as usual, a ^ 

b means that a/b —?► 1. Indeed, introducing Z := (n„ 2 /g — nr?js')!^JvnZj 
one has Z —>• Z in distribution, so that \ Lm,s-w = E/„_ 2 (Z) —>■ E/k, 2 (Z) = 
^ ^fw, 2 {jn + Z^/s). This convergence is justihed, since /k, 2 (^) is uniformly 
integrable (as e.g. in Theorem 5.4]), which in turn follows because for any A 
and a in ( 0 ,oo) one has Eexp = exp |A(e“^/'^— 1 + 1/-\/A)} ^ < 00 

and fK,a{x)le~^ —)■ 0 as a; —>■ — 00 . 

Let ry denote an arbitrary real-valued r.v. Recalling that for any natural a 
and any ru G K the function f^i^a belongs to and applying the Markov 

inequality, one sees that Theorem |2.4| immediately implies 


Corollary 2.7. Let Xi ,..., X„ be any nonnegative r.v. ’s satisfying the (m, s)- 
condition for some m and s in (0,oo), so that ( |2.4[ ) holds. Then 

(2.19) 

( 2 . 20 ) 
( 2 . 21 ) 


P(S'„ ^ a;) < P3{Y.rL-m,s]x) 

^ T 3 (^cxD;m,s: 

^ P3(m -E Za/s; a;); 


here and in what follows, x is an arbitrary real number (unless otherwise indi¬ 
cated), 


and 


Zin;m,s ■— “E ’ ’ ’ “E Ln’ 


for natural n, 


Aloo;m,s ■— 


Pa{r]-,x):= inf 


E(r(; - rjY 


we(x,oc,) (w — x)'= 


( 2 . 22 ) 

(2.23) 


for any real a > 0. Also, the upper bound P^{m -E Zy/s] x) on P{Sn ^ x) can be 
somewhat improved: 


P{Sn ^ x) ^ P 2 {m-\-Zy/s-,x). (2.24) 

The computation of Pa{r];x) is described (in a somewhat more general set¬ 
ting) in |27L Theorem 2.5]; for normal rj, similar considerations were given al¬ 
ready in page 363] (those descriptions are given for the right tail of rj, so 
that one will have to make the reflection x i—>■ —x to apply those results). An 









Iosif Pinelis/ 


elaboration of nzi Theorem 2.5] is presented in [301 Proposition 3.2], Concerning 
fast and effective calculations of the positive-part moments E , see |3T]. In |3] , 
one can find specific details on the calculation of Pa{'q-,x) for a S {1,2,3} and 
rj with a distribution belonging to a common particular family such as binomial 
and Poisson. 

Let us present here some of those results, which will be useful in this context. 
Take any real a > 1 and any r.v. 77 such that E? 7 “ < 00 ; then there exists 
E 77 G (— 00 , 00 ]. Let 

Xt ■= x^.{t]) := infsupp(? 7 ), (2.25) 

where supp( 77 ) denotes the support set of (the distribution of) the r.v. 77 , and 


i{w) ■■= limw) 


E 77 ( 777 - 77 ))} ^ 
E(777 - 77)r' 


for w G (a;*, 00 ). Then, by |30l Proposition 3.2], the function 7 is continuous and 
nondecreasing on the interval (x*, 00 ) and for every x G (x*, E 77 ) there exists a 
unique Wx = Wx-a ,tj & (x*,oo) such that 


7 ( 7773 ,) = x; 


in fact, Wx G (x,oo). It follows that, for every x G (x*, E 77 ), 

{ < 0 for 777 G (x*, Wx), 

= 0 for 777 = Wx, (2.26) 

> 0 for 777 G (7773,500); 

in particular, Wx is the only root in (x*,oo) of the equation 



S'a 

0 

II 




Also by [30l Proposition 

3.2], 






P (77 < x) 

= P (77 = x) 

for 

X G 

(—00, X* 

Paimx) = < 

E“(777, - 
E““^(7773: 

ri)r^ 

- 77 )“ 

for 

X G 

(x*, E 77 ) 


1 

V. 


for 

X G 

[E 77,00). 


In particular, the upper bound Pa{ri;x) on the left-tail probability P (77 ^ x) is 
exact for x G (—oo,x*]. 

Thus, to evaluate Pa(ri\x) for any real x, it is enough to find Wx (that is, to 
solve equation (2.27)) for any x G (x*, E 77 ). 

This is especially easy to do if the r.v. 77 takes values in a lattice, which 
is the case when 77 is or Soo;m,s) as in Corollary 2.7 Again by [301 

Proposition 3.2], 

Pa{a + br]-,x) = Pa{r]-, ^) 
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for all real x and a and all b G (0, oo). So, the calculation of Pa{r]; x) for 77 equal 
'P‘n-,m,s or Soo;m,s reduces to the situation when the r.v. 77 is integer-valued with 
Xt = x^{r]) = 0; assume for now that this is the case. In view of (2.19) and 
(2.20), assume also that a = 3. Then, by (2.26), 


^3-,x{w) := UjW^ — 2bjW + Cj, 
where x G (a:*, £ 77 ) = (0, E 77 ), tc G (a;,,oo) = (0,oo), 


(2.28) 


aj 


Cj 


= [tc — 1 ] (so that j G 0 ,00 and j < w ^ j + 1), 
= ayx := E(77- a:)I{?7 < j} , 

:= E 77 ( 77 -a;)I{77^ j}, 

:= E?7^(77-a7) I {?7 < j} . 


— 


= Ci 


Therefore and in view of (2.27) and (2.26), for each x G (x*, E 77 ) = (0, E 77 ) one 
finds Wx as the only root in the interval {jx,jx + 1 ] of the quadratic equation 


26-j' ~\~ Cj — 0, 


(2.29) 


where jx '■= min {j G 0, 00 : a,- (j -I-1)^ — 2bj {j + l) + Cj ^ 0|. If a,^ ^ 0 then, by 
([ 2 ( 2 ^ and Wx is the greater of the roots of the above quadratic equation. 

The interesting paper [5] presents, for any given tt, G 0,oo U { 00 } and A G 
( 1 , 00 ), the exact upper bound (say S^.a) on P(5' ^ 1) under the condition that 
S = YIi=i where the X^’s are independent r.v.’s such that 0 ^ ^ 1 for 

all 7 G 1,77 and ES' = A. (For A G [0,1], the exact upper bound is trivial 
and equals 1; indeed, let Xi take values 0 and 1 with probabilities 1 — A and A, 
respectively, and let X^ = 0 for all i G 2, 77 .) Note that the conditions 0 ^ X^ ^ 1 
for all i and E ^ = A imply ^ - E X^ = A and ^^ E Xf < A, which corresponds to 
the ( 777 , s)-condition with m = s = X. So, it makes sense to compare the bound 
P 3 (S„;A,a; 1 ) in (2.19)-(2.20) with i3„,A- Graphs of these two bounds and their 
ratio in the case 77 = 00 are shown in Figure 


bounds 



ratio 



0.4 

0.2 - 


A 

0 20 40 60 80 100 


Fig 1. Left panel: graphs {(A, P3(Soo;A,a; 1)) • 1*1 ^ A ^ 8} (solid) and 
1.1 ^ A ^ 8} (dotted). Right panel: graph {(A, Bco,a/^3(^oo;A,a 5 1)) ■ 1-1 ^ ^ lOO}. 


The calculations of F 3 (Soo;A,a; l) here were done in accordance with the above 
description, containing formulas (|2.25)-(2.29); it takes less than 0.3 sec with 
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Mathematica on a standard laptop to produce either of the two graphs in Fig¬ 
ure It can be seen that the bound P 3 (T,ao-,\,\'A) is not much greater than 
the optimal bound Bao,x, especially when A is close to either 1 or oo; the corre¬ 
sponding comparisons for finite n look similar. On the other hand, our bounds 
Pd.{^n-,m,s', x) hold under much more general conditions: (i) for all a; G K, rather 
than just for a: = 1; (ii) assuming only the (m, s)-condition (on the sums of 
the first and second moments of the Xi's), rather than requiring all the Xi's to 
be bounded by the constant 1 - which latter also coincides with the value of x 
chosen in (iii) assuming the more general dependence conditions. 

By [30l Proposition 3.5], 

Paiv.x) t PooiiTjx) := inf (2.30) 


as a increases from 0 to oo; thus, the bounds Pa(rj]x) improve on the so-called 
exponential bounds Poo{ri;x). In particular, letting 


X — m 

A := — and z := —, 

s Vs 


one has (cf. ( |2.I9 ), ( |2.20| ), and (2.24)), 


P 2 {m + Z^/s] x) 
P 3 (Aloo;m,s ^ x'^ 


P3 (Ztri;m,s ; x'j 


^ Poo(m + ZyG;x) = e ^ 

^ VxD {Aloo;m,s ; 


= exp < - 


{ Va)^''(^^ Va) TaI. 


^ P^(m-\- Zy/s;x), 

^ -foo 5 

^ \ A-|- 2 \/A 


V A “h Z‘\f\ / 

^ -foo (^C5o;m,s ; 5 


n — X 


, — X — z\fX 


'i—X—zy/X 


(2.31) 

(2.32) 

(2.33) 

(2.34) 

(2.35) 

(2.36) 

(2.37) 


for natural n > A and z G [—VA, 0); for z = — vA, the expressions in ( 2.33| ) 
and (2.36) for Paa{Z^n-,m,s')x') and Ppo(Soo;m,g ;a:) a re defined by continuity, as 
e~^ and (1 — A/n)”, respectively; inequalities (|2.34|) and (|2.37|) follow by (|2.30|), 
([ 2 ^, d^T^ , and d^ . 

The exponential upper bounds (2.31) and (2.35) are the same (up to a shift, 
rescaling, and reflection x >-)■ —x) as Hoeffding’s bounds in [TH (2.1) and (2.3)], 
where they were obtained under an additional condition, which can be stated 
in terms of the present paper as 


P{Xi ^ ^) = 1 for all i Gl,n. 


(2.38) 


Note that (2.38), together with the conditions (2.12) and (2.14), implies the 
second inequalities in (2.1) and (2.2) with Si := —rrii. 
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For independent (but without the additional restriction ( 2.38[ )), the 
exponential upper bounds in ( |2.31 ) and (2.33) on P(«S'„ ^ x) — as well as the 
exact upper bound E/(^nm 2 /s] on E /(S'„) for /(x) = with /i < 0 — were 
essentially obtained in [38l Theorem 7]. Note two mistakes concerning the latter 
result: (i) in the proof in [38j, should be replaced by 'ip{hu) and (ii) what is 
presented as the proof of Theorem 7 in [35] is in fact that of Theorem 8 therein, 
and vice versa. Results of [35] seem yet relatively unknown, as the bound e~^ 
on P(S'„ ^ x) appeared later in [2T] . 

By [27| Theorem 3.11] or [28l Theorem 4], with Ca,o := r(a + l)(e/«)“j 


Paiv'^x) < Ca ,0 P(b < x) 


provided that the tail function x i—P(77 ^ x) is log-concave. Combining this 
result with the Cantelli inequality, one also has the following upper bound on 
P(S'„ ^ x): 

W{z) := min (^1, C 2 ,o P^Z ^ z)); 

note that C 2 ,o = e^/2 = 3.69.... This bound may serve as an easier to compute 
and deal with approximation to the better bound P 2 {m + Zy/s\x). 


log|0 C(z) '°El0 log,„ C(z) 



Fig 2. Decimal logarithms of the bounds/tails P{z), for A = 10 (first row) and A = 3 (second 
row). The columns correspond to n = 11 (left), n = 30 (middle), and n = oo (right). 

All the mentioned upper bounds P{z) := Paiv'j x) for rj equal or m -|- 

Z^/s can be fully expressed in terms of z, A, and n. These bounds are compared 
graphically in Figure]^ for A S {3,10}, a G {0,2,3,oo}, n G {ll,30,oo}, and 
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0 £ (—-x/A, O) ; note that P(S„.m,s ^ x) = Pa{^n-,m,s\ a;) = 0 if z < —y/\] here, as 
is natural, Pa(Xin-,m,s] x) is interpreted as the true tail probability P(Sn;m,s ^ x) 
for a = 0. The graphs of log^o Pa{^n;m,s', x) shown in Figure]^ are red: stepwise 
for a = 0, solid-continuous for a = 3, and dashed-continuous for a = oo. The 
graphs of logj^g Pa {m+Zy/s; x) are black: solid for a = 2, and dashed for a = oo. 
No graphs are shown for PaiZin^m^s^x) with a = 2, as those are not established 
bounds; nor is there a graph for Pa{m + Zy/s;x) with a = 3, as the better 
bound with a = 2 is available. Also, a graph for W{z) is shown, dotted-green. 

It is seen that the bound P 3 2 :) is close to the true tail probability 

P {Z‘n;m,8 ^ a;), especially for A = 10 and n = 11, with a zero error at the left 
end-point (—-x/A) of the range of each of the r.v. — 'm)/y/s, which is in 

accordance with part (iv)(b) of the mentioned [501 Proposition 3.2]. In the latter 
case (A = 10 and n = 11), the bound P-i{Zn-,m,s',x^ is over 8 times better near 
the left-end point of the range than the “normal” exponential bound 
However, P;i(J^n-,m.s', x) may be slightly greater for z near 0 than the “normal” 
better-than-exponential bound P 2 (m + Zy/s; xj ; this is due to the fact the class 
.^ 1 '^ is somewhat richer than 


3. Proofs 


Proof of Theorem 2-4 


(I) By a standard induction argument (cf. e.g. |29l Lemma 12]), in order to 
prove part (I) of the theorem, it is enough to show that \2.7\ holds for n = 1. 
Moreover, by [53 Corollary 5.9], we may assume that / = f ^^2 for some w G K, 
where /uj ,2 is defined by formula ( 2.18| ). So, the proof of part (1) will be complete 
once it is shown that 

E/„.2W < (3.1) 

whenever the r.v. X is nonnegative, EA ^ to, EA^ ^ s, w G M., and 0 < 


TO < y/s. For w < 0, both sides of (3.1) are zero. So, w.l.o.g. w > 0. Introduce 
now z V ■.= w y z, and c := and then g{x) := c^{v — x)"^. Then 

p^ym.s g {0,z}) = 1, fyj^2 < On [0, oo) and f^^2 = 9 on {0, z}, whence 
g(jjzm.,s^ _ 2 (A"^’®) almost surely (a.s.). Note also that ?; > 0 and recall the 

relations E A ^ to = E A"*’® and E A^ ^ s = E(A'"’®)^. Thus, 


E/^, 2 (^) Eg{X)=c^{v^-2vEX + EX^) 

< {v^ -2vE A”"’" -E E(A’"’"'2 

= E5(A™’*) = E/,,,2(r’"’*) 


)^) 


which completes the proof of part (1) of Theorem 2.4 
( 11 ) Take any / £ and consider 


A„j(Pi,...,P„) := +..-+A™"’^’'), 


the right-hand side of (2.7), where 


Pi := {mi, Si) 


(3.2) 
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for all i. Note that the function F^j is symmetric (with respect to all permuta¬ 
tions of its n arguments, Pi,, Pn)- Next, if nonnegative r.v.’s Xi ,..., sat¬ 
isfy the (m, s)-condition, they satisfy the (m, s)-condition for some mi,..., mn, 
si,..., s„ such that mi -I- • • • -I- m„ = m and si -I- • • • -I- s„ = s. So, by (2.7), to 


prove (2.8) it is enough to show that 


Fnj{Pl, • ■ ■ , Pn) < FnjiPn, • ■ • , Pn), 


(3.3) 


where P„ := ^(^*1 + • • • + Pn)- Here we shall need the following lemma, which 
establishes a Schur-concavity-like property of the symmetric function Pn,/. 

Lemma 3.1. For any natural n ^ 2 and any t S [0,1] 

Fnj{Pli ■ • ■ , Pn) ^ Fnj{Pi+tt P 2 -t, P3 • ■ • , Pn), 
where Pi+t := (1 — t)Pi -l- tP2 and hence P2-t = tPi + (1 — t)P2. 


The proof of Lemma [3.1 [ will be given at the end of this section. 

Note that P„,/ is a function of n points Pi,...,P„ in K^, rather than of 
n real arguments. If the latter were the case, then Lemma |3.1| together with 
the well-known Muirhead lemma (see e.g. m Lemma 2.B.1]) would imme¬ 
diately imply the Schur-concavity and hence (3.3). However, no appropriate 


“multidimensional” analogue of the Muirhead lemma seems to exist. Indeed, 
if one defines the “multivariate” majorization by means of doubly stochas¬ 
tic matrices (in accordance with the Hardy-Littlewood-Polya characterization 
— see e.g. [20l Theorem 2.B.2]), then the analogue of the Muirhead lemma 
fails to hold. For example, take n = 3 and consider the doubly stochastic 
3x3 matrices (say A and Pj, for some t G [0,1]) that transform any triple 


r := {Qi,Q 2 :Q 3 ) of points in to (say) f := ( 


_/ Q1+Q2 Q1+Q3 Q2 + Q3 


) and 


2 ’ 2 ’ 2 

Tt := ((1 — t)Qi + tQ 2 ,tQi + (1 ~ ^)Q 2 ,Q 3 ), respectively; matrices such as 
Bt are referred to as T-transform matrices, all of which latter can be written 
as C~^BtC for some t G [0,1] and some permutation matrix C; see e.g. [23 
Section 2.B]. Then, if the points (5i,Q2,Q3 are not collinear, already after one 
application of any matrix Bt with t G (0,1) to r one will never be able to get 
from Tt to T via any chain of P-transforms, since the points and 

do not belong to the convex hull of the set {{l — t)Qi+tQ 2 ,tQi + {^ — t)Q 2 ,Q 3 }- 
We shall verify (3.3) by induction on n. For n = 1, ( |3.3[ ) is trivial. Suppose 
that (3.3) holds for n equal some natural k, and consider n = k + 1. Introduce 
Pk ■=^Pk + (1 - ■^)Pfc+i, fk+i{x) := and gk+i{x) := 

E /(a:-|-F™''+i’®'=+^), where {fhk+i,Sk+i) ■= Pk+i- By Remark ! 
fk+i and gk+i are in Also, 


2.3 


the functions 


I {Pk + {k-l)Pk) =Pk+i. 


(3.4) 
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It follows that 


^fe+l,/(^’l, • ■ • ,-pfc+l) 

= ^PkJk+1 (^1) • ■ ■ J Pk) 

— Pk+l,f {Pk+l ; Pk Pk ) 


^ Fk + ljiPk, Pk+1, Pk, ■ ■ • , Pk) 

= E-Ffc,3i,^.i(Pfe,-P/c, ■■■,Pk) 

^ E {Pk+l, ■ ■ ■ , Pk + l) 

= Fk+ij{Pk+i, ■ ■ .,Pk+i) 


(by the definition of fk+i) 
(by induction) 

(by the definition of fk+i 
and the symmetry of ibc+i, 
(by Lemma 
(by the definition of gk+i) 
(by induction and ( |3.4[ )) 

(by the definition of gk+i)- 


3.1 


with t = ^ 


/) 

) 


This completes the proof of (2.81, modulo Lemma 13.1 


By an argument similar to that used in the proof of part (I) of Theorem 2.4 
it is enough to verify (2.9) and (2.10) for / = /u,, 3 , and (2.11) for / = /^_ 2 - 


In inequality (2.8) with n + 1 instead of n, take = 0 and Xi = " 

for i G l,n; it then follows that the right hand-side of (2.8) is nondecreasing in 


n, for any fixed positive real m and s. Next, (i) all the r.v.’s in (2.8) and (2.9) 
are nonnegative, (ii) the function /uj ,3 is continuous and bounded on 
(iii) Yj" -I- • • • -I- Yi" converges in distribution to ::!:n 


b, 




as n 


o), and 
oo. So, 


the right hand-side of (2.8) is, not only nondecreasing in n, but also converging 


to the right hand-side of (2.9) as n —>• oo (for / = fw, 3 )- Thus, (2.9) follows. 


As for inequality (2.10), it is essentially a special case of (2.11). Indeed, 
consider the latter inequality with n —>■ oo and Xi = x["'\ ..., A„ = being 
independent copies of c„nA„, where c„ := ~ ^ and A„ := ^^1^2 ~ 

Then the r.v.’s Xi,...,X„ satisfy the (m, s)-condition, and Sn converges to 


7 n„, 2 /g in distribution. Therefore 


E/^.2(5„)^E/„.2(^n^2/,). 


Thus, it remains to prove (2.11), for / = fw, 2 - If at that w ^ 0, then the left- 
hand side of (2.11) is zero, while its right-hand side is nonnegative. Therefore 


also in view of (2.7) and (2.4) 
it suffices to show that 


and by rescaling, w.l.o.g. w = 1. Also, as in the proof of part (I) of Theorem 2.4 
w.l.o.g. n = 1. Thus 
Theorem 12.41 


to complete the proof of 


S(m) := 6{m, k) := 


E (1 - TO - kmZY^ - E (1 - 
2(]Pm? + {m — 1 )^) 


^0 


for all TO G (0, oo) and k G (1, oo). Take indeed any k G (1, oo). Note that 


5'{m) k^ [k^m? + (to — 1)^)^ 


{D6)i{m) if TO G (0,1/fc^], 
{Dd) 2 (rn) if to G [1/A:^, oo). 
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where 

{D5)i{m) := k'^m{l — m) — ^), 

V km / 

{D5)2{m) := {k^ — l){k^m — 1 + m) — k^m'^^p(— —, 

V km / 

and If is the standard normal density function. Next, for m G (0,1/A:^] one has 
to(1 — to) > 0 and 


d 1 

/{DS)iim)\ 

k^ {k’^wf + (to — I)^) 

/TO-lX 

dm 

\to(I — to) / 

^ (I — to)^to^ 

\ km / 


so, {D 6 )i — and hence 6 ' — may change in sign on the interval (0,1/fc^] at most 
once, and only from + to —. Similarly, for to S (1/fc^, oo) one has k^m— 1 + to > 
k^m — 1 > 0 and 


d 1 

/ (D5)2{m) \ 

k^ {kl^m — l) {k’^mf + (to — I)^) 

/TO — 1\ 

dm 

\kfm — I + TO/ 

TO {kfm — I + to)^ 

\ km ) 


so, {D 6)2 — and hence <5' — may change in sign on the interval [1/fc^, oo) at most 
once, and only from + to —. Thus, S' may change in sign on the interval (0, oo) 
at most once, and only from + to —. It follows that 6 {m) ^ <5(0+) A S(oo—) for 
all TO € (0,oo). So, to complete the proof of Theorem |2.4[ it remains to check 
that ^(0+) A (5(oo—) ^ 0. In fact, one can see that <5(0+) = 0 and 


2S(oo-) = q(t) := P(Z > t) - > 0, (3.5) 

with t := 1/k > 0. The inequality in (3.5) is well known; see e.g. [12 (19) for 
()) 2 ]; alternatively, it follows because q'(t) = — < 0 q{oo—) = 0. This 

I_I I_I 

completes the entire proof of Theorem |2.4[ modulo Lemma |3.1| □ 


Proof of Lemma \3.1\ W.l.o.g. n = 2 — cf. e.g. the first equality in the big 
display following (3.4|. Also, by the symmetry under permutations, w.l.o.g. t G 


[0,^]. Moreover, w.l.o.g. t here and elsewhere we are using (sometimes 

tacitly) a version of continuity relevant in a given context. So, it suffices to show 
that G'[t) ^ 0 for alH G [0, ^), where 


G{f) := Gp„p,(t) := F 2 j{Pi+u P 2 -t). 


(3.6) 


Actually, it is enough to show that 


C?) 

G'(0) ^ 0, 


(3.7) 


because for any t G [0, ^) and s := one has Pi+t = (1 —s)Pi+t+sP 2 -t and 
P 2 -t = sPi+r + {l-s)P 2 -r, whence Gp^^p^{t) = Gp^+^^p^.^( s) and G'p^ p^{t) = 
G'p^^^ p^_^(0)/(I—2 t). Next — cf. the proof of part (I) of Theorem 2.4 — w.l.o.g. 

and all x G M. Thus, 


f{x) = {w — x)p for some w G 


G{t) = E(W - _ ym2-t.S2-t)3 


+ ’ 
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where {rriu, s„) := for any u. li w ^ 0 then G{t) = 0 for all t, so that 
there is nothing to prove. Therefore, by rescaling, w.l.o.g. w = 1. So, in view of 
Definition |2.2[ G{t) can be expressed in terms of the variables t, a, p, b, q only, 
where 

a:= — >0, 6:=^>0, p:=^G(0,l], g:=^e(0,l]. (3.8) 

mi TO2 Si S2 

By the symmetry relation Gp^^p^it) = Gp^^p^{t) (and continuity), w.l.o.g. 0 < 
b < a, so that 0<6<a<a + 6 . Thus, it suffices to consider the following four 
cases: 


(Co) 1 G (a + 6 , oo); 

(Cl) 1 G (a, 0 + 6 ); 

(C 2 ) 1 G (6, o); 

(C 3 ) IG ( 0 , 6 ); 

at that, with each case it is assumed 0 < 6 < a and 0 < p, g < 1. In each of the 
cases (Cfc) with fcG {0,1,2,3}, the expression 


Dk := AkG'(O), 

is a polynomial in o, 6 ,p, q, where 

^0 := Ij Ai := As := a^b^, and A2 := o^. 


(3.9) 


(3.10) 


Therefore, to finish the proof of inequality (3.7) and thus that of Lemma |3.1[ it 
remains to verify the following lemma. □ 

Lemma 3.2. In each of the cases {Gk) with k G {0,1, 2,3}, the polynomial Dk 


in a,b,p,q, defined by (3.9) and (3.10), is nonnegative for all p and q in (0,1). 


Proof of Lemma \3.^ For each k, Dk is a polynomial and the conditions that 
define the case Gk are polynomial (in fact, affine) inequalities. So, the verifica¬ 
tion that Dk is nonnegative in each of the cases Gk can be done in a completely 
algorithmic manner, due to the well-known Tarski theory [151 [T51 [7] . This the¬ 
ory is implemented in Mathematica via Reduce and other related commands. 
Thus, the Mathematica command Reduce [derO < 0 && caseO] (where derO 
and caseO stand for Dq and [(Co) & 0 < 6 < a & p G (0,1) & 9 G (0,1)], 
respectively) outputs False (in about 0.3 sec on a standard desktop), which 
means that indeed Dq ^ 0 in the case (Co). Cases (Ci), (C 2 ), and (C 3 ) can be 
treated quite similarly, with Mathematica execution times of about 5.4 sec, 0.65 
sec, and 0.04 sec, respectively. Details of the corresponding calculations can be 
found in the Mathematica notebook solution-tarsky.nb and its pdf copy solution- 
tarsky.pdf in the folder Mathematica in the zip file LeftTailBounds.zip posted 
at the SelectedWorks site works.depress. com/iosif-pinelis/7/download/. 
The symbols derO, ..., der3 in our Mathematica notebooks correspond to 


Do ,... ,Ds defined by formulas (3.9)-(3.10) in the paper. 

This completes the proof of Lemma 73.2[ which appears no less reliable than 
computations done “by hand”; cf. e.g. the views of Okounkov page 35], 
Voevodsky m, and Odlyzko [25] on computer-assisted proofs. □ 
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However, as Okounkov [22j notes in his interview, “perhaps we should not 
be dependent on commercial software here”. Indeed, details of the execution of 
the Mathematica command Reduce [] are not open to examination. Therefore, 
in addition to the above proof, in each of the next two sections an alternative 
proof of Lemma |3.2| is provided. 

The proof of Lemma |3.2| given in Section relies, instead of the Mathe¬ 
matica command Reduce, on the Redlog package of the computer algebra sys¬ 
tem Reduce; both Reduce and Redlog are open-source and freely distributed 
(http://WWW.redlog.eu/). 

The proof of Lemma [3^ given in Sectionj^uses only standard tools of calculus 
and also such a standard tool of algebra as the resultant, available in a number 
of open-source computer algebra software packages. 

Recall that, for each k S {0,1, 2,3}, Dk is a polynomial in a, b,p, q. For each 
k G { 0 , 1 , 2 , 3}, in the case (Ck), the quadruple (a, b,p, q) belongs to the set 

■— ^ ( 0 ; 1) j 

where 

Wo :={(a, &)GK^: 0 < 6 <a<a-|- 6 <l} 

= {(a, 6) G : 6 < a < a -I- 6 < 1}, 
wi :={(a, &)GK^: 0 < 6 <a<l<a-|- 6 } 

= {(a, 6 ) G : 6 < a < 1 < a -I- 5}, 

W 2 :={(a, &) G : 0 < 6 < 1 < a}, 

W 3 :={(a, &) G : 1 < 6 < a}. 

For each k G {0,1,2,3}, let w^ denote the topological closure of Wfc, so that Wfe 
is defined by the system of non-strict inequalities corresponding to the strict 
inequalities defining the set ujk- 

We shall use notation such as the following: 

Dk-,p=S ■ , 5 ’ ^k-,q=e ■ ^k\p=S,q=e ■ ^k\p=S,q=E^ (3.13) 

sometimes in such notation we shall use, instead of Dk, a modified version Dk 
of Dk, which differs from Dk by a factor which is manifestly positive in the 
corresponding context. 

In files pertaining to the mentioned package Redlog and in 
subsequently used Mathematica notebooks, we shall use 
notations such as derO,..., der3, derOpO,..., derSplql 
for Do,...,p 3 ,Do.p^o,...,D 3 .p^i^g^i, respectively, or 
possibly for D. in place of D.. 


(3.11) 


(3.12) 


4. First alternative proof of Lemma |3.2| 

Unfortunately, for polynomials in several variables the mentioned package Red¬ 
log is either much slower than Mathematica (as in the cases of the polynomials 
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Dq and £>3 in ( |3.9| )) or unable to complete the desired verification of the nonneg¬ 
ativity (as in the cases of the polynomials Di and in ( |3.9[ )). I have also tried 
another well-known open-source program, QEPCAD B (Quantifier Elimination 
by Partial Cylindrical Algebraic Decomposition, Version B), but it crashes even 
where Redlog eventually produces the result. 

More specifically, Redlog verifies the nonnegativity of the polynomials Dq 
and £>3 (in cases (Cq) and (C 3 )) in about 107 min and 0.45 sec, respectively; 
details on this can be found in the .log files der0.log and der3.log and in the 
corresponding .png files derO.png and derS.png. 

The .log and .png files mentioned in this section are in the folder 
Reduce (Redlog) in the zip file LeftTailBounds.zip at the SelectedWorks 
site works.depress.com/iosif-pinelis/7/download/. 

These execution times, 107 min and 0.45 sec, may be compared with the 
corresponding ones for Mathematica, mentioned in the proof of Lemma |3.2| in 
the preceding section: 0.3 sec and 0.04 sec). 

To verify the nonnegativity of the polynomials Di and D 2 with Redlog, each 
of these two verification problems has to be reduced, by a human, to a series 
(or rather a tree) of simpler problems, as presented below. 

Lemma 4.1. In the case (Ci), the polynomial Di in a, b,p, q is nonnegative for 
all p and q in (0,1) - that is, Di ^ 0 for all (a, b,p, q) G fli- 

Proof. Assume indeed in this proof that (a, b,p, q) G fli, unless otherwise stated. 
One has 

Di = of’b^p - 2a%^p + a^b'^p + aV - 3aV + 3aV - a'^p'^ 

- 2aHp'^ + &a%p^ - &a%p^ + 2a%p^ + Sa^b'^p^ - Sa^^p"^ 

+ 2a^b^p^ — a^b'^p^ + a^b^q — 2a^b'^q + a^b^q + 2a^b^pq 
- ea^b'^pq + 6a%^pq - 2af'b^pq - 6a^b^pq + QaH^pq - 2a%^pq 
- 2ab^q^ + ?,a^b^q^ - a%^q^ + b'^q^ + Qab'^q^ - ?,a^b‘^q^ + 2a^b‘^q^ 

- Zb^q^ - &ab^q^ + Zb^q^ + 2ab^q^ - b\'^. (4.1) 


Consider 


dlD^dlDi-{d^d,Dif _ 

4a^(a —6)^6^ ^ 

:= —2 -I- 9a — 15a^ -I- lOa^ — 3a® -I- a® -I- 96 — 33a6 -I- 48a^6 — 36a®6 -I- 15a"^6 
- Za% - 156^ -b 48a62 - + 240^6^ - Za^"^ -b 106® - 36a6® -b 2AaH^ 

- 7a^b^ + 15a6‘^ - Za'^b^ - 36® - 3a6® -b 6®; (4.2) 


here and in the sequel, da denotes, as usual, the partial differentiation in a. 
Using the mentioned package Redlog, we see that deti < 0 on wi; this takes 
about 0.5 sec; see details in the files derldet.log and derldet.png. 
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Hence, the determinant of the Hessian matrix of with respect to p and q is 
negative for all (a, q) G ffi. It follows that Di is saddle-like in p and q, and 
so, for each hxed (a, b) G wi, the minimum of the polynomial Di in (p, q) G [0,1]^ 
is not attained at any point {p,q) G (0,1)^; therefore, this minimum is attained 
at some point (p, (?) on the boundary of the unit square [ 0 , 1 ]^. 

Consider then each of the four boundary subcases of Case 1: p = 0, p = 1, 
q = 0, and q = 1. Using Redlog, we see that Di-p^o ^ 0 for (a, b, g) G x (0,1) 
(execution time « 0.25 sec; details in the files derlp0.log and derlpO.png) and 
Di-q=o ^ 0 for {a,b,p) G wi x (0,1) (execution time « 0.25 sec; details in the 
files derlq0.log and derlqO.png). 

The subcases p = 1 and < 7=1 require more care. Recall notation (3.13). 

To consider the subcase p = 1, assume that (a, b) G uji and q G (0,1). In view 
of (pi), 


Di-p^i = a^(a — 3a^ -I- 3a^ — a"^ — 26 -|- Gab — Ga^b + 2a^b + 36^ — 3a6^ -I- a^b^) 
+ a^b^{2 — Ga + Ga^ — 2a? — Gh a?b -\- Gb^ — 2ab^ — b^)q 

— b^(2a — ia? -|- — 6 — Gab + 3a^b — 2a?b -\- 36^ -I- Gab^ — 36^ — 2ab^ + b‘^)q^■ 

(4.3) 


Using Redlog, we see (in about 0.16 sec) that 

■= " - &)(1 - bf + a\2b - a) ^ 0 (4.4) 

and (in about 0.8 sec) that 


nOOl ._ 

^l;p=l,q=l ■ — 

= 0^(6 - 6 - 2 a) - 2(1 - hfb{2a - b) - 20^(3 - 6 ^) -f 0^(2 - 6 ^) 0 

(4.5) 

(details in the hies derlpl.log and derlpl.png; notations D002derlpl and 
DOOlderlplql there correspond to D^^p^i and respectively). So, 

Di-p=i is convex and decreasing in q. At that, 

^i;p=i|,=i = (a-6)2[a2(l-a)3 + (l-6)V] >0. (4.6) 




We conclude that indeed D 


l;p=l 


> 0 . 


To complete the proof of Lemma 4.1 it remains to consider the subcase < 7 = 1 . 
Expanding Di-q^i in powers of p, one has 

Di.q=i = ipip) := Ap^ -k Hp -h C, 


where 


A := —a^{a* — 2a?b — 3a^ -I- Ga^b + 3a^ — 2ab^ + 3ab^ — Gab — a + b'^ 

-3b'^ + 2b), (4.7) 

B ■= -a^b^ia^ + 2a% - Ga^ - ab^ + Ga + 26^ - 66^ -k 66 - 2), 

C := b^{a^b'^ — 3a?b + 3a? + 2ab^ — Gab^ + Gab — 2a — b'^ + 36^ — 36^ -|- 6). 
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Let 


, - A.AC 

=8 - 36a + 60a^ - 44a^ + 12a^ - 366 + 132a6 - 180a^6 + 108a^6 - 24a'‘6 
+ a% + 606^ - 192a6^ + 192a^6^ - 840^6^ + lOa‘^ 6 ^ - 406^ + 144a6^ 

- 840^6^ + 210^63 - 60a6^ + 120 ^ 6 ^ + 126^ + 12a6® - 46®, 

di '■= ^ = 2 — 6a + 6a^ — a® — 66 — 2a^b + 66 ^ + a 6 ^ — 26®, 

d 2 := jf = 2 a — 6 a^ + 6 a® — 2a‘*^ — 26 + 6ab — 6a^b + 2 a ®6 

a^{a — b) a^(a — 6 ) 

+ 66 ^ - 6 a 6 ^ + a^b^ - 66 ® + 3a6® + 26^. 


Note that discr equals in sign the discriminant of the quadratic polynomial 
ipip)- Therefore, discr > 0 if and only if 'tjj{p) takes both positive and negative 
values as p varies from —oo to oo. Using Redlog, we see that (i) A ^ 0 (« 0.16 
sec execution time); (ii) the conjunction of the conditions discr > 0 , di < 0 , 
and 6 < 1/2 never takes place over the set wi (« 3.1 sec execution time); 
and (iii) the conjunction of the conditions discr > 0 , ^2 > 0 , and 6 > 1/2 
never takes place over the set wi (« 25.5 min execution time); details are in 
the files derlql.log, derlql-top.png (for the first 10 Redlog commands), and 
derlql-bottom.png (for the last 10 Redlog commands); in those files, AA stands 
for A/a®, with A as in (4.7). So, over the set wi one has the following: (i’) 
the function ip is convex; (ii’) if 6 < 1/2 and ip changes sign over K, then 
ip'{0) > 0 and hence ip{p) is nondecreasing in p S [ 0 , 1 ]; and (hi’) if 6 > 1/2 
and Ip changes sign over M, then ' 0 ^( 1 ) ^ 0 and hence ip{p) is nonincreasing in 
p € [0,1]. Thus, in view of the continuity of -Di;g=i = ip{p) in 6 , it remains to 
verify that Di-q=i^p=Q := '0(O)/6® = C/ 6 ® and = '0(l)/(a — 6 )^ = 

{A-\- B + C)/{a — 6 )^ are both nonnegative (over oji). For Di;q^i^p^o this is 
checked by Redlog in about 0.05 sec (details in files derlql.log and derlql- 
bottom.png), whereas Di-q^i^p^i = a^(l — a)®-f 6^(1 — 6 )® is manifestly positive 
(over uji). 

This completes the proof of Lemma |4.1[ □ 


Lemma 4.2. In the case (C 2 ), the polynomial D 2 in a, b,p, q is nonnegative for 
all p and q in (0,1) - that is, D 2 ^ 0 for all (a, b,p, q) G fli- 


Proof. Assume indeed in this proof that (a, b,p, q) G U 2 , unless otherwise stated. 
One has 


D 2 = a^p—3a®p-|-3a^p—2a^6p-|-a®6^p+3a®p^ —3ay-|-2a'^6p^ —a®6^p^ —2a6g 
+iafbq-Gb'^q-?,afb'^q+a^b^q-6a^bpq+6a^b^pq-2a^b^pq+6ab'^q^-ih^q^-6ab^q^ 

pib'^q^ + 2ab^q^ -b^q^. (4.8) 
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Using Redlog (details in the files der2.log, der2-top.png, and der2-bottom.png), 
we see that 


i = p — 9ap+18a^p—12a^bp + 3ab‘^p + 9ap^ — ISa^p^ + \2a?hp^ — 3ab^p^ 
+ 3bq - 3b^q + b^q - 6bpq + Qb'^pq - 2b^pq ^ 0 (4.9) 

on ^2 (execution time « 22.6 min) - so that D 2 is convex in a, 

D 2 -a=i = p—2bp+b^p+2bp‘^ — b^p‘^+bq—2b‘^q+b^q—Qbpq+6b‘^pq—2b^pq+6b‘^q‘^ 

- 9b^q^ + - h^q^ ^ 0 (4.10) 

for b,p,q in ( 0 , 1 ) (execution time « 1.2 sec), and 


daD2 


I a—1 


= 5p-8bp+3b^p-3p^+8bp^-3b^p^+Abq-6b^q+2b^q-12bpq+12b^pq 
- ib^pq + db'^q^ - Qb^q^ + 2b‘^q‘^ > 0 (4.11) 


for b,p, q in (0,1) (execution time « 1.2 sec); the symbols der2DDa, der2al, and 
Dder2al in the mentioned Redlog files stand for | d^D 2 , £* 2 ;a=i, and daD 2 \^^-^, 
respectively. To complete the proof of Lemma |4.2[ it remains to recall the defi¬ 
nition (3.11). □ 


Lemma 3.2 follows immediately from Lemmas |4. 1 1 and 4.2 and the nonnega¬ 
tivity of Dq and ZI 3 , mentioned in the beginning of this section. 


5. Second alternative proof of Lemma 3.2 


First here, let us briefly describe how to use the resultant tool in problems of 
polynomial optimization. Let K[xi,... ,a;„] denote the ring of all polynomials 
in indeterminates a;i,..., a;„ over a field RT; see e.g. |45) for the algebraic termi¬ 
nology used in this description. 

In the case when n = 1, this ring is written as Ri[a;], the ring of all polynomials 

in X over K. Suppose now that f{x) = oocc" H-h a„ and g(x) = 60 a;” + ■ ■ ■ + bn 

are two polynomials in K[x]. The resultant R{f,g) = RK{f{x), g{x)) of these 
two polynomials is the determinant of the (n -I- m) x {n + m) Sylvester matrix 
^ m} -I- here it is assumed that = 0 if 

k ^ 0,n and = 0 if £ ^ 0, m. Thus, i?(/, g) is a homogeneous polynomial of 
degree n -I- to in oq, ..., a„, 60 ,..., 6 rn- The Mathematica notebook resultant.nb 
and its pdf copy resultant.pdf show the Sylvester matrix for n = 3 and to = 4. 

The .nb files and their .pdf copies mentioned in this section are in the folder 
Mathematica in the zip file LeftTailBounds.zip at the SelectedWorks site 
works.depress.com/iosif-pinelis/7/download/ 

The remarkable property of the resultant is that R{f,g) = 0 if and only if 
ttQ = bo = 0 or the polynomials f{x) and g{x) have a common root, possibly in 
an algebraically closed field C containing the field K; moreover, 

R{f,9) = A{x)f{x) + B{x)g{x) 


(5.1) 
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for some polynomials A{x) and B{x) in K\x\^ whose coefficients are polynomials 
over Z in oo,..., an, bo, ■ ■ ■, bm', see e.g. [ISl Section 5.8]. 

Take now any natural n and let f{xi, ..., x„) and g{xi, ..., x„) be any poly¬ 
nomials in K[xi,... ,Xn\- These polynomials may be identified with the corre¬ 
sponding polynomials f(xi,..., Xn-i)(xn) and g(xi,..., Xn-i)(xn) in the ring 
-K(xi, ... , Xn-i)[xn] of all polynomials in the single indeterminate Xn over the 
field K{xi, ... ,Xn-i) of all rational functions in indeterminates xi, ... ,Xn-i 
over the field K. Thus, one has the resultant 


Rx„if,g){xi, ■ ■ ■,Xn-l) 

•— (^f {x I, . . . , Xn—l} (Xn), g{x\, . . . , Xn—l) (Xn)) 

of the polynomials f{xi,..., Xn) and g{xi,..., Xn) with respect to indeterminate 
Xn- Clearly, R,^^{f,g){xi,...,Xn-i) € K[xi,... ,Xn-i]- Moreover, by ( [^ , 

RxSf,g){xi, ■ ■ ■,Xn-l) = A{xi,. . .,x„)f{xi,. . .,Xn)+B{xi,. . .,Xn)gixi, . . . , X„) 

(5.2) 

for some polynomials A{xi ,..., x„) and B{xi,..., Xn) in K[xi ,..., Xn]- So, if 
the polynomials f(xi,..., x„) and g{xi,..., x„) have a common root (oi,..., a„) S 
C", then (/, g){xi,..., Xn-i) has a root - namely, (ai,..., a„_i) - in C”“^. 
Consider now a system of n polynomial equations 


fj{xi, ...,Xn) =0 ioT all j e 1, n, 

in the n indeterminates xi,... ,Xn, where fj{xi,... ,Xn) G K[xi,... ,Xn] for 
each j. If this system has a root (ai,..., a„) G C”, then the reduced system of 
the n — \ polynomial equations 


- = 0 for j G 1, u - 1, (5.3) 

in the n — 1 indeterminates xi,..., Xn-i, has a root - namely, (oi,..., Q!„_i) 
- in C'^~^ . Thus, the indeterminate x„ has been eliminated. Continuing in 
this manner, one arrives at one equation of the form Fi{xi) = 0, for some 
polynomial T’i(xi) G K[xi]. Quite similarly one obtains polynomial equations 
Fj{xj) = 0 for each j G 2,n. In the “nondegenerate” case - when all the 
polynomials Fi(xi),... ,Fn(Xn) are nonzero - each of the resulting equations 
Fi(xi) = 0,..., Fn(xn) = 0 has only finitely many roots (in C and hence in K). 
One can then check which of the finitely many n-tuples of those roots are roots 
of the original system fj{xi,... ,Xn) = 0 for all j = I,...,n. Thus, one can 
see that, at least in the “nondegenerate” case, resultants can be used to solve 
systems of polynomial equations by successive elimination, somewhat similarly 
to solving systems of linear equations. 

In the remaining, “degenerate” case, other, more computationally intensive 
tools of algebraic geometry need to be used, such as the calculation of a Grobner 
basis, which, in particular, allows one to determine the dimension of an algebraic 
variety; see e.g. |H1 Ch. 9]. One may hope, though, that such a degeneracy is 
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unlikely to occur in a particular problem. Also, if a degeneracy indeed occurs, 
one may turn to using other methods, say ones of calculus if the field K is M. 

Suppose now, in the case when (say) K = M., one wants to show, as we 
do in the proof of Lemma 3.2 that a polynomial /(xi,... ,x„) is nonnegative 
everywhere on a (say) compact subset of M" of the form 


n := {(xi,... ,x„) e M”: gi{xi ,... ,x„) ^ 0 Vi € 1 ,to}, 

for some natural m and some nonzero polynomials ^^(xi,..., x„) in 
K[xi,... ,x„]. First here, if the minimum of /(xi,... ,x„) over (xi,... ,x„) € fl 
is attained at an interior point (ai,... ,a„) of fl, then this point is critical for 
/, that is, the partial derivatives, say /i,..., /„, of the function / respectively 
in xi,..., x„ vanish at the point {ai ,..., an)- So, one can use the resultants, as 
described above, to obtain a finite set containing all the critical points of / in 
the interior set 


int n = {(xi,..., x„) £ : gi(xi,..., x„) > 0 Vi £ 1, m} 

of n - provided that the non-degeneracy holds. Similarly, for any set J C l,m, 
one can try to obtain a finite set containing all the critical points of / in the 
interior set of the (in general curved) J-face 

{ (Xl, . . . , Xn) £ (xi, . . . , Xn) — 0 Vi £ V} 


of the set fl - by considering the polynomial system of equations involving the 
Lagrange multipliers; see e.g. [321 page 434]. (if the faces of are subsets of affine 
subspaces of K" - as they are in the proof of Lemma |3.2[ then the elimination 
of variables and hence the minimization of / over the faces are much simpler.) 
Thus, unless a degeneracy is encountered, one reduces the verification of the 
nonnegativity of the polynomial function / on the set 12 to that on a finite set. 

One can also try to use some of the various Positivstellensatze of real algebraic 
geometry (see e.g. [HllITl 11111113]), which can provide a so-called certificate of 
positivity to a polynomial that is indeed positive on a set defined by a system of 
polynomial inequalities (over M); that is, by an appropriate Positivstellensatz, 
the positive polynomial can be represented as a polynomial (with positive co¬ 
efficients) in simpler polynomials that are manifestly positive on the given set. 
This method was used successfully in [33]. However, it does not appear to be 
very effective in the proof of of Lemma 3.2 and will be used only little there. 

One can also use a combination of all these and/or other methods. In fact, 
we shall try to avoid, as much as we can, using resultants or other algebraic 
tools. Instead, we shall try to use, as much as possible, calculus tools such as 
monotonicity and concavity/convexity, which are oftentimes much more efficient 
in eliminating variables. 

Apparently any proof of Lemma |3.2| will involve a very large amount of al¬ 
gebraic and arithmetic calculations; in particular, note the execution time of of 
about 5.4 sec mentioned in the above proof of Lemma |3.2[ based on Tarski’s 
theory. Of course, for the proof to be valid, any arithmetic calculation needed 
therein must be carried out in an exact arithmetic. 
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Let us now get specifically to the proof of Lemma [3^ Recall definitions (3.11 1 
and (3.12). 

The following lemma eliminates one of the variables p, q. 


Lemma 5.1. For each k € {0,1, 2, 3} and for each fixed (a, b) S oJk, the min¬ 
imum of the polynomial Dk in (p,q) S [ 0 , 1 ]^ is not attained at any point 
(PjQ) G ( 0 ) 1 )^/ hence, this minimum is attained at some point {p,q) on the 
boundary of the unit square [ 0 , 1 ]^. 

Proof of Lemma \5.1\ Details of calculations in this proof can be found in Math- 
ematica notebook noInnerExtrIn_pq.nb and its pdf copy noInnerExtrIn_pq.pdf. 
One has 


Do = a^p — 2ofbp + ab'^p + dta^p^ — Qaf’p^ + a'^bq — 2ab‘^q + b^q 

— \2abpq + (Safbpq + Qab^pq + Qb^q^ — 6b^q^. (5.4) 

Note that Dq is saddle-like in p and q, that is, the determinant 

dpDo dgDo - {dpdqDoY = -36a^(a - 6)^6^ 

of the Hessian matrix of Dq with respect to p and q is negative for all (a, b,p, q) G 
Hq. Hence, the case fc = 0 of Lemma [5T] follows. 


The other cases of Lemma 5.1 are similar, except that the case k = 1 is 
more complicated than the rest. Let us defer this case to the end of the proof 
of Lemma [O] and consider the cases k = 2 and fc = 3 next. 


Concerning fc = 2, recall (4.8). One has 

dlD2dlD2-{dpdqD2f 

= -40^(0 - b)b^{i - 36 -f 62)[(1 - 6)(3 - 6 ) -b (a - 1)(6 - 46)] < 0 


for all (a, b,p, q) G 0,2- Hence, the case fc = 2 of Lemma 5.1 follows. 
Concerning fc = 3, one has 


Do = a'^p — 2a^bp a^b^p — a'^p^ -\- 2a^bp^ -j- a^b^q 

— 2ab^q b'^q — 2afb^pq -\- 2ab^q^ — b'^q^ (5.5) 


and 


dpDo dqDo - {dpdqDoY = -8a'^(a - byb'^ < 0 


for all (a, b,p, q) G flo- Hence, the case fc = 3 of Lemma 5.1 follows. 

It remains to consider the case fc = 1; further details on this, more difficult 
the files noInnerExtrIn_pq-scratch.nb and 


case 


are 


given 


noInnerExtrIn_pq-scratch.pdf. Recall (4.1) and (4.2). It suffices to show that 
deti < 0 for all {a,b,p,q) G Hi, that is, for all (a, 6 ) G uJi - since deti does not 
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depend on p or q. To that end, we shall first show that the maximum of deti 
over oji is not attained at any point (a, h) G wi. Indeed, otherwise 

Ai^odeti :=| cla deti 

=3 - 10a + lOa^ - 5a^ + 2o® - 116 + 32a6 - 360^6 + 200^6 

- ha% + 166^ - 36a6^ + 24a^6^ - 4a^6^ - 126^ + 16a6^ 

- + 56^ - 2ah'^ - 


and 


Aop deti :=^ dt deti 

=3 - 11a + Iha^ - 12a3 + 5a^ - a® - 106 + 32a6 - 36a26 
+ 16a36 - 2a% + 106^ - 3606 ^ + 24a^62 - + 20ab^ 

- Aa^b^ - 5b* - 5ab* + 26® 

would both vanish at that point (a, 6 ), and then so would the resultant 
Rbi^ip deti, Ao,i deti)(a) 

= - 243(1 - afa^i-S + 21a - SSa^ + 84a3 - 60a^ + 17a®) 

X (3168 - 13325a + 236450^ - 22308a® + llOlGa"^ - 3456a® + 432a®) 

of the polynomials Ai^o deti and Agp deti with respect to 6 . However, by 
Sturm’s theorem (say), this resultant (which is a polynomial in one variable, 
a) does not have roots in the interval (i, 1 ), whereas the condition (a, 6 ) G Wi 
implies a G (|, !)• This contradiction completes the verification that the maxi¬ 
mum of deti over wi is not attained at any point of toi. 

Consider finally the values of deti on the boundary of wi. One has 

deti L =-2 + 18a - 630^ 116a® - 126a'‘ -f 72a® - 17a® < 0 

^ I b—a 

for a G ( 5 ,1); 

deti|^^^ =6®(-9-hl26- 662-p6®) <0 for 6 G (0,1); 
deti \a-i-b ~ “9(1 “ <0 for 6 G (0,1). 

Thus, deti is no greater than 0 on the boundary of wi and does not attain 
its maximum over d)i at any point (a,6) G wi. It follows that deti < 0 for all 


follows as well. □ 

In view of Lemma |5.1[ it remains to verify, for each k G {0,1,2, 3}, that 
Dk ^ 0 for all (a, 6 ) G Wfc and all {p, q) G [0,1]^ such that either p G (0,1} or 
q G {0,1}. Thus, for each k G {0,1,2,3}, one has to consider 4 possibilities: 
p = 0, p = 1, q = 0, and q = 1, which results in 4 x 4 = 16 subcases. These 16 
subcases will each be considered in one of the corresponding 16 lemmas below; 
some of these lemmas are very simple, and some are rather complicated. Recall 


(a, 6 ) G Wi, that is, for all (a, b,p, q) G fli. Hence, the case fc = 1 of Lemma 5.1 
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notation (3.13). Details of calculations in the proofs of these 16 lemmas can be 
found in the corresponding Mathematica notebooks and their pdf copies. The 
names derOpO.nb (der0p0.pdf), ..., derSql.nb (der3ql.pdf) of these notebooks 
(and pdf files) correspond to the polynomials Dg-^p^Q, ..., whose non¬ 

negativity is stated and proved in these lemmas. E.g., the “root” derOpO of the 
names of the files derOpO.nb and der0p0.pdf with extensions .nb and .pdf is ob¬ 
tained from the symbol Dg-p-g by replacing there D by der and removing 
and “=”. 

Lemma 5.2. Dg.p^g ^ 0 for all {a, b) G ujg and q G (0,1). 

Proof of Lemma\5^ In view of ( |5.4[ ), Dg.p^g = bq[{a — b)^ -I- 6bq{l — 6)], which 
is manifestly nonnegative for all (a, b) G uig and g G [0,1]. □ 

Lemma 5.3. Dg.p^i ^ 0 for all {a, b) G wg and q G (0,1). 


Proof of Lemma 5.3 Assume indeed in this proof that (a, b) G ujg and q G (0,1), 
unless otherwise indicated. In view of (5.4), 


Do;p=i = bq{7a^ + Aab — 12a + r) — a{5a^ + 2ab — 6a — r) 6(1 — b)b‘‘q . 

So, dqDg.p=i = 12 ( 1 — 6 ) 6 ^ > 0, whence Dg.p^i is convex in q. Next, Dg.p=i^q=i = 
(a — 6)^[6 — 5(a-I-6 )] > 0 and {dqDg.p=i)\^^.^ = (^~ 6 ) 6 [ 46 - l - 7(a-|-6 ) — 12 ] < 0, 
since a -I- 6 < 1 . Thus, Lemma [ 573 ] follows. □ 


Lemma 5.4. Dg.q^g ^ 0 for all {a,b) G ujg and p G (0,1). 


Proof of Lemma \5.4\ In view of (5.4), Zlo;g=o = ^^[(a ~ + 6ap(l — a)], which 

is manifestly nonnegative for all (a, b) G uig and p G [0,1]. □ 

Lemma 5.5. Dg;q=i ^ 0 for all {a,b) G ujg and p G (0,1). 

Proof of Lemma\5.5\ Assume indeed in this proof that (a, b) G ujg and p G (0,1), 


unless otherwise indicated. In view of (5.4), 


Dg.q^i = a^b + 66^ — 2a5^ — 56'^ -|- a'^p — 12abp + Aa'^bp + 7ab‘‘p + 6a^p‘‘ — 6a^p^. 


If Dg.q^i has a local extremum at some point (a, b,p) G uig x (0,1), then at this 
point 

Dg^q=i ■=daDg.q^i = 2ab — 2b^ + Zafp — 12bp + 8abp + 7b^p + 12ap^ 

- 18aV = 0, 

Dg]q=i '■=dbDg.q=i = of + 126 — 4a6 — 156^ — 12ap -I- 4a^p -I- 14a6p = 0, 

:=i dpDg.^q=i =a^ - 126 -b 4a6 -b 76^ -b 12ap - 12a^p = 0 

(these three partial derivatives are denoted by DlOOderOql, DOlOderOql, 
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DOOlderOql, respectively, in the files derOql.nb and derOql.pdf). So, (cf. ( |5.3[ )), 
■ 6a2(a-6) 

= - 4a^ + 30^ + 966 - 148a6 + 67a^b - lUb"^ + 4906^ + 496^ = 0, 

res°“ := ^ = _i2a + Sa^ + 366 - ab - 496^ = 0, 

2a(a — 6) 

res :=i?a(resi°\ res°“)(6) = 8646(76 - 4)(76 - 1)2(31562 - 1686 - 16) = 0. 

The latter equation, res = 0, together with the condition (a, 6) G wq (which 
implies 0 < 6 < ^), yields 6 = ^. However, res^°^ lb- 1/7 ~ f (1 —a)2(27+7a) ^ 0. 
So, -Do;(j=i does not have a local extremum at any point (a, b,p) G wq x (0,1). 

It remains to show that -Do;(j=i ^ 0 for all {a,b,p) G (Do x [0,1] such that 
p G {0,1} or (a, 6) is on the boundary of cuo! the latter, boundary condition on 
(a, 6) means that 

(0 < a < 1, 6 = 0) or (0 ^ o = 6 ^ i) or (0 < 6 = 1 — a < t). 

Now indeed, ^ = (“ “ 

6)2[6 - 5(a + 6)] ^ 0, Do.q=i\f^^g = a‘^p[a + 6p(l - a)] ^ 0, and Do.g^i\^^^ = 
6(1 — a)a2(l — p)2 ^ 0. It remains to show that 

-Do;g=l,b=l-a ■=^0;(J=1 

=1 + a — 4a2 + 2a^ — a(5 — 2a — 4a2)p + 6(1 — a)a^p^ ^ 0; 


this expression is denoted by derOqlbla in in the files derOql.nb and derOql.pdf. 
If iDo;g=i,b=i-a has a local extremum at some point (a,p) G (0,1)2, then at this 
point 


^0-,q=l,b=l-a ■—daDQ-q—i^b=l-a 

=1 — 8a + 6 o 2 + (—5 + 4a + 12a2)p + 6(2 — 3a)ap2 = 0, 

^ 0 ; 9 =i.b=i-a '=l dpDo.g^i^b=i-a = -5 + 2a + 4a2 + 12(1 - a)ap = 0 

(these two partial derivatives are denoted by DlOderOqlbla, DOlderOqlbla, 
respectively, in the files derOql.nb and derOql.pdf). Hence, 

-^a(-Po; 9 =l,b^l-a> -Po;g^l.b=l-a)(Q^^) ^ 247 ^ _ 229p‘^ + 57p^ = 0 , 

12(1 -p) 

^pi^fq=l.,b=l-a-> -^ 0 ;g=l,b=l-a)(®’ 


6a2 


= (-19 + 130a - 208a2 + Oba^) 

X ( — 1 2ci) = 0, 


whencepG [^,^] and a G {^j U [^, ^] U [^, ^], with the correspond¬ 
ing values of iDo;q=i,b=i-a in 

r 8797 19381 ] , , [ 1050018257 11209569 ] , , [ 23386805609 1055742276137 ] ^ [n 
1-20000’ 40000-1 ^ 1-1250000000’ 12500000J ^ 1-312500000000’ 5000000000000-1 ^ 
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here we used the standard method of interval calculus - see e.g. [i7l Sec¬ 
tion 14.9.4]. 

It remains to show that I?o;(j=i,h=i-a ^ 0 when a or p is in the set {0,1}. But 
indeed Zlo;g—l,fc —1 —a I — 1 ^ 0; l,fe—l — 1 .^—I”*! |p—o — 

(1 — a)[l -I- 2a(l — a)] > 0, and Ilo;ij=i,6=i-a|p_;^ = (1 ~ 2a)^ ^ 0. Thus, the 
proof of Lemma |5.5| is complete. □ 

Lemma 5.6. > 0 for all (a, b) S wi and q G (0,1). 

Proof of Lemnia \57S[ Assume indeed in this proof that (a, b) G wi and q G (0,1), 
unless otherwise indicated. Also just in this proof, let 


In view of (4.1), 


Di 


b^q' 


D. 


Dl -, P =0 _ ^2 


= a\a - by 


+ q{—a‘^ + 2a^b — 3a^b + 3a^ + 2ab^ — 6ab^ + Gab — 2a — b'^ + 36^ — 36^ + b), 


which is of degree 1 in q. So, without loss of generality q G {0,1}. Next, 
-Di;p=o. 9 =o = a^(a — by > 0. Further, 

Di.p^o^q=i = (1 - bfb - 2o(l - by + 0^(3 - 36 + 6^) 


is convex in a. 


^i;p=o,,=iL^i_, =(1 - by[l + 26(1 - 6)] 0, 

{daDi;p^o,,=i)l^,_, =2(2 - 6)(2 - 6) ^ 0, 

5 i?l;p=0,g=l = 5 (^aL^l;p=0,g=l) = — 1 “h 66 — 66^ -|- 26^ ^ 0 

if 6 ^ i (which latter follows from the conditions a = b and (a, 6) G wi). Also, 
(a, 6) G Wi implies that a ^ (1 — 6) V 6. Thus, T>i;p=o,g=i ^ 0, which completes 
the proof of Lemma |5.6[ □ 

Lemma 5.7. Ili;p=i ^ 0 for all (a, 6) G wi and q G [0,1]. 

Proof of Lemma 1 5. 7| Assume indeed in this proof that (a, 6) G wi and q G (0,1). 
Recall (4.3) and ( |4.4| ); in Mathematica notebook derlpl, we keep the notations 
D002derlpl and DOOlderlplql for and respectively. Note 

that = —6[1 -I- 2(a — 6)](1 — 6) < 0, so that is concave in 6. 

Next, = 6(1 - > 0 and 1 [■D?°p=iL=J L=«+i /2 = ! + (§- 

3m -I- 2u^) > I > 0, because the condition (a, 6) G wi implies | < a < 1 and 
hence u := a — f > 0. 


So, Lli{p=i > 0, whence Zli;p=i is convex in q. Recall ( |4.6| ) and ( |4.5| ). It 
remains to show that < 0. 
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Consider 


:= \ = 2 - 20a3 + 6a^(6 -b)-b^- 6a(3 - 6^); 


this expression is denoted by D201derlplql in Mathematica notebook derlpl. 
Note that = 12a — 6& > 0, so that is convex in b. Next, 

= -(1 - a)2(7a - 1) s$ 0 for a > i and -D?°pLi,,=i = 

2 — 18a + 36a^ — 21a^ attains its maximum value —0.102 ... < 0 over all a > ^ 


at a= i(4 + y2) = 0.77.... 

So, 71i°pLi,q=i < 0, which shows that is concave in a. Further, 

71?!],=i.9=iL^b = = -6(i - b)^^ < 0. It follows that 

indeed < 0, which is what remained there to complete the proof of 

Lemma 15.71 □ 


Lemma 5.8. Zli;q=o ^ 0 for all (a, 5) G Wi and p G (0,1). 

Proof of Lemma \57^ Assume indeed in this proof that (a, b) G wi and p G (0,1), 
unless otherwise indicated. Also just in this proof, let 


In view of (4.1), 


Di ■■= 


a^p 


Di-q^o = (a —6)^6^ + [a(l —a)^ + 6(2a^ —6a^ + 2a6^ —3a6+6a —6^ + 36—2)]p, 

which is of degree 1 in p. So, without loss of generality p G {0,1}. Next, 
-Di;q=o.p=o = (® ~ > 0. Further, 

i)i:g=o,p=i = (1 - a)^(a - 26) + (3 - 3a + a^)6^ 


is convex in 6, 

Lii:q=o,p=i=(1 — o)^[l + 2a(l — a)] ^ 0, 
(9o7ii;q=o,p=i) =2(2 — a)(l — a) > 0, 


and so, Zli;q=o,p=i ^ 0, because the condition (a, 6) G uji implies 6 > 1 — a. 
Therefore, 71i;g=o ^ 0, which is equivalent to Di-g^Q ^ 0 and thus completes 
the proof of Lemma 5.8 □ 


Lemma 5.9. Di-q^i > 0 for all {a,b) G wi and p G (0,1). 

Proof of Lemma [57P[ Assume indeed in this proof that (a, 6) G Wi and p G (0,1), 


unless otherwise indicated. In view of (4.1), 


Di-q^i = b^{a^b^ — 3a^b + 3a^ + 2ab^ — 6ab^ + Qab — 2a — b‘^P 36^ — 36^ + 6) 
— a?h'^p{a^ + 2a?b — 6a^ — ah^ + 6a + 26^ — 66^ + 66—2) 

— a^p^{a^ — 2a^b — 3a^ + 6a^6 + 3a^ — 2ab^ + 3a6^ — 6a6 — a + 6^ — 36^ + 26). 
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If Di-q^i has a local extremum at some point (a, b,p) € Wi x (0,1), then at this 
point 

daDi-q^i =2b^{ab‘^ — Sab + 3a + b^ — 36^ + 36—1) 

— ab^p{5a^ + 8a^b — 24a^ — Sab^ + 18a + 46^ — 126^ + 126 — 4) 

— a^p^{7a^ — 12a^6 — 18a^ + 30a^6 + 15a^ — 8a6^ + 12a6^ 

— 24a6 — 4a + 36^ — 96^ + 66) = 0, 

dhDi-q^i =6^(5a^6^ — 12a^6 + 9a^ + 12a6^ — 30a6^ + 24a6 — 6a — 76"^ 

+ 186^ - 156^ + 46) 

+ 2a^p^{a^ — 3a^ + Sab^ — Sab + 3a — 26^ + 36—1) 

— 2a^bp{a^ + Sa^b — 6a^ — 2ab^ + 6a + 56^ — 126^ + 96 — 2) = 0, 

=^ 2(2 _ 6a + 6a^ - - 6b - 2aH + 66^ + ab^ - 26^) 

— 2ap(a^ — 2a^b — Sa^ + 6a^6 + 3a^ — 2ab^ + Sab^ — 6ab — a 

+ 6^ - 36^ + 26) = 0; 

these three displayed expressions are denoted respectively by DlOOderlql, DOlOderlql, 



Iosif Pinelis/ 


31 


DOOlderlql in the Mathematica notebook derlql.nb. So (cf. (5.3)), 


101 1; q2 1) 

a%^a-b) 

=3a^6 + a®(116^ - 60& + 24) - a^(152 - 2Q7b - 66^ + 356^) 

- 0^(556"* - 3486^ + 369&2 + 4866 - 408) 

- 0^(976^ - 6606'‘ + 17646^ - 16726^ - 2766 + 600) 

- a‘‘(336® - 4926® + 21966'‘ - 43046® + 35166^ - 3606 - 520) 

+ a®(56^ + 1266® - 11436® + 37246® - 57726® + 40686^ - 7326 - 264) 

- 3a^(6 - 1)^(56® - 106® + 806® - 3166® + 4686^ - 2246 - 24) 

+ 4a(6 - 1)®(6 + 2)(56® - 146® + 306^ - 266 - 1) 
-12(6-l)^6(6+l)(6 + 2) =0, 

oil b{,p{di}D\-q—\^ 1 )(ft, 6) 

2a262(ft - 6) 

= 2ft®°6 + a®(196^ - 546 + 18) - a®(76® + 1236^ - 3126 + 120) 

- ft^(886® - 2826® - 1356^ + 7946 - 342) 

+ ft®(526® + 3666® - 13386® + 5856^ + 10146 - 540) 

- a®(856® - 306® + 8946® - 29476® + 20526^ + 5406 - 510) 

+ 3a®(756^ - 2096® + 3186® + 1396® - 11086® + 9426^ - 546 - 96) 

- ft®(6 - 1)(526^ + 5866® - 18176® + 21226® - 986® - 14846^ 

+ 4746 + 90) 

+ 6a^(6 - 1)2(176^ - 346® + 1646® - 3006® + 2026® + 266^ - 376 - 2) 

- 6a(6 - 1)®6(136® - 176® + 726^ - 266 - 6) 

+ 2(6 - 1)®62(6 + 2)2(76 - 4) = 0, 
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res :=i?h(res^°\res°^^)(a) 

= 221184(o - + 6o - l){13a^ - 2la^ + 12a - 2) 

X (242a^ - 2130a® + 8562a® - 20049a‘‘ + 29184a® - 25956a2 

+ 12844a - 2700)"^ 

X (10232022093578922a®2 + 243983422645932620a®® 

- 4509200751103371792a®° + 64738047761822250900^® 

+ 392173262502922141262a®® - 4886702832420734534706a®®’ 

+ 33179277660685865127615a®® - 156920889820420563967402a®® 

+ 563853803520130467807750o®^ - 1609531040121454475809908a®® 

+ 3749969793954874952607178a®® - 7259852838023387480456886a®® 

+ 11824823903860313224996785a®® - 16347303258436435525821596o®® 
+ 19300470236420386651602360a®® - 19542251444785239731292096o®®’ 
+ 17012372613520345095894435a®® - 12747057132858406422499290a®® 
+ 8218546949212261087298286a®'® - 4552228300462560181850156a®® 

+ 2159681892325551379446354a®® - 873593583957851406529086a®® 

+ 299376707230436244983721a®® - 86180301652329538603686a® 

+ 20605976715300445636407a® - 4032357710602439708982a®’ 

+ 633335473568252968362a® - 77778352414976716710a® 

+ 7204715390026622547a® - 478176202267116138a® 

+ 21028578810363360a® - 535896014633136a + 5818539184128) = 0; 

the three resultants above are denoted respectively by reslOl, resOll, res in 
the Mathematica notebook derlql.nb. The latter equation, res = 0, has no roots 
a € (|, 1) - which latter condition is implied by the condition (a, 6) G oji. So, 
does not have a local extremum at any point (a, b,p) G wi x (0,1). 

So, it remains to show that -Di;g=i > 0 for all (a, b,p) G Wi x [0,1] such that 
p G {0,1} or (a, 5) is on the boundary of wi; the latter, boundary condition on 
(a, b) means that 

< a = 6 < 1) or (0 < 6 < 1, a = 1) or < a = 1 — 5 < 1). 

Now indeed, 

= 6®[(1 - 6)®(6 - 2a) + a®(3 - 36 + 6®)] 

is convex in a. At that, = ^®(1 — 56® + 66® — 26"®) > 0 and 

daDi-q=i^p^o\^_^_^ = 26®(2 — 6)(1 — 6) > 0 for 6 G [0, ^]. Also, for all (a, 6) G wi 
one has a > 1 — 6 and (a = 1 — 6 6 G [0, |]), whence i4i;q=i_p=o ^ 0. 

Next, 

-Di; 5 =i,p=i = ^i;g=i|p^i = ~ 5)®[a®(l — a)® + (1 — 6)®6®] ^ 0 
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for all (a, b) e Wi. 

Further, for all (a, 6 ) G wi one has a = b b G [j)!]) whence for all 

P e [ 0 , 1 ] 

i5i;,=i.a=6 = ^1;9=1 - 66" + 66 - 1) ^ 0. 

It remains to consider the two more difficult boundary cases, with a = 1 and 
with 0 = 1 — 6 . 

Consider now 

^l;9=l.a=l '^l;g=l|a=i 

= 56^ - 6 ® + p - 26^(4 + p) + 6^(4 + 7p- p^) + 6(1 - 8 p + 2p^). 

(5.6) 

If Ili;g=i.a=i has a local extremum at some point ( 6 ,p) G (0,1)^, then at this 
point 

^p,=i.a=i = (6 - 1)(562 - 106 - 1) + 2(36 - 4)p + 2p^ = 0, 

^?; 9 =i.a=i :=9p^i;9=i.a=i = 1 - 86 + 76^ - 26^ + 2(2 - b)bp = 0 ; 

these two displayed expressions are denoted respectively by DOlOderlqlal, 
DOOlderlqlal in the Mathematica notebook derlql.nb. Hence, 

^p(£>l;g^l a^l,^l,g^l,a^l)( 6 ) ^ ^ ^ _ 53^4 ^ 

2(1 — b)^ 

— 1 = 392 - 1304p + 293^^ + 626^^ 

4(1 — p)^ 

+ 28p‘^ - 40p® = 0; 


these two resultants are denoted respectively by respderlqlal, resbderlqlal 
in the Mathematica notebook derlql.nb. For the roots of these two resultants. 


one has 6 G 3 ^] U [^, ^] and p G with the corresponding 

values of 7)i in the set [ 1899988901 672192933 1 y r 444839 7710771351 1 ^ 

vctiLAv-a vji- i-q—i^a—L LIQOOOOOOOOO ’ 2500000000-1 ^ It^nnnnnn^ i nnnnnnnnnn I 


L5000000’ 10000000000J 


[0, oo). Therefore, Di.q^i^a=i ^ 0 at the interior critical points (6,p) G (0,1)^ 
of 1 1 . At that, l,a —1 I b—Q — P ^ 6, l l | — (1 p) ^ 0, 

i?i;q=i,a=i| Q = 6 + 62(2 - 6)2(1 - 6) ^ 0, and iii;q=i,a=i| ^ = (1 - 6)^ ^ 0 


(5.7) 


for (6,p) G [0,1]2. Thus, Ili;q=i_a=i ^ 0 and, in view of ( |5.6| ), 
-Dl;q=l,a=l = -^l;g=l|a=i 6 for (6,p) G [0,1]^. 

Finally, to complete the proof of Lemma |5.9[ consider 


Dv. 


l;9=l.a=l-h (i_b)2b2 -Di;q=i | 

=p + 6(1 — lip + 6p^) + 62(2 + 14p — 12p2) — 6^(2 + 4p — 6p^), 

(5.8) 
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for {b,p) e [0, i] X [0, 1]; the latter displayed expression is denoted by derlqlalb 
in the Mathematica notebook derlql.nb. If Di.q^i^a=i-b has a local extremum 
at some point {b,p) S (0, \) x (0,1), then at this point 

Dl°q=l,a=l-b ■=dbDi-q=i,a=i-b = I - Up + + 6(4 + 28p - 24p^) 

- 662(1+ 2p-3p2) = 0, 

D'i]q=i,a=i-b = 1 - 26(5 - 6p) + 6^(4 - 12p) = 0; 


these two displayed expressions are denoted respectively by DOlOderlqlalb, 
DOOlderlqlalb in the Mathematica notebook derlql.nb. Hence, 


l,g_l a-l ^ l,g-l a-l b)\ ^ j 36 - 806^ + 966^ = 0 , 

6(1 — 6)^(1 — 26) 

Rb{DY\q=l^a=l-b^ R'i]q=l,a=l-b)ip) 


12(1-p) 


= -43 + 247p - 229p2 + 57p'' = 0, 


whence 6 G P ^ the corresponding values of 

I>i: 5 =i,a=i-b in the set [ J 2 V 0 OT 000 ’ 1 ^ 2 si) 0 ° oto 0 ] the latter two resultants 

are denoted respectively by respderlqlalb, resbderlqlalb in the Mathemat¬ 
ica notebook derlql.nb. Therefore, 71i;5=i,a=i-b ^ 0 at the interior critical 
points (6,p) e (0 ,\) X (0,1) of Di-q=i,a=i-b- At that, i)i;,=i,a=i-b|f,^o = P ^ 0, 
i)i: 5 =i,a=i-b|j^i = I(1 - p)^ > 0, I)i;,=i,a=i-b|p^(j = 6 -k 26^(1 - 6) ^ 0, and 
Di;q=i,a=i-b\p^^ = (1-26)2 > 0 for (6,p) e [0, x [0,1]. Thus, iii;q=i,a=i-b ^ 

0 and, in view of ( |^ , Ili;g=i.a=i-b = A>i; 9 =i > 0 for (6,p) e [0, ^Jxp, 1]. 

This completes the proof of Lemma |5.9[ □ 


Lemma 5.10. D 2 ;p^o ^ 0 for all (a, 6) G W 2 and q G (0,1). 


Proof of Lemma 5.10 Assume indeed in this proof that (a, 6) G 0^2 and q G 
(0,1), unless otherwise indicated. Also just in this proof, let 


In view of (4.8), 


D2 := 


bq- 


D 


D 


2;p=0 


2 ;p=0 

bq 


= —2a -k 3a2 -k 6 — 3a^b + afb^ -k (2a — 6)6(3 — 36 -k b^)q, 


which is of degree 1 in q. So, without loss of generality q G {0,1}. 

Next, let us show that D 2 -p=o,q=o = —‘2-a -k 6 -k a2(3 — 36 -k b"^) ^ 0. Since 
dbD 2 -,p=o,q=o = 1 -k a2(26 — 3) < 1 -k (26 — 3) < 0, without loss of generality 
6=1. But D 2 ;p=o,q=o\i,^i = (1 ~ n)2 ^ 0, which conhrms that D 2 ;p=o,q=o ^ 0. 
Finally, to complete the proof of Lemma 


5.10 


let us show that 7l2;p=o.g=i = 
(1 — 6)^(6 — 2a) + a2(3 — 36 -k 62) > 0. Note that D 2 -,p=o,q=i is convex in a, and 
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at that i) 2 ;p=o, 9 =iL^i = 1 + (2 - 6)^(1 - 6)6 > 0 and {daD 2 -p=o.q=i)\^^-^ = 
2[2(1 — 6^) + 6^] ^0 for 6 S [ 0 , 1 ]. Since the condition (a, 6) € uj 2 implies 
a > 1, we conclude that indeed £* 2 ;p=o, 9 =i ^ 0, which completes the proof of 
Lemma 15.101 □ 


Lemma 5.11. Il 2 ;p=i ^ 0 for all (a, 6) G 002 and q G (0,1). 


Proof of Lemma 5.11 Assume indeed in this proof that (a, 6) G UJ 2 and q G 
(0,1). In view of (|4.8|, £> 2 ;p=i = / 1 / 2 , where fi := fi{q) := + 6(6- 2a)q 

and /2 := f 2 {q) := 1 — 6(3 — 36 + b^)q. Let us show that each of the two 
factors fi and /2 is nonnegative. Since each of them is of degree 1 in q, it it is 
enough to note that /i(0) = ^ 0, /i(l) = (a — 6)^ ^ 0 , / 2 ( 0 ) = 1^0, and 

/2(1) = (1 — 6)^^0. This completes the proof of Lemma 5.11 


□ 


Lemma 5.12. D 2 -q=o ^ 0 for all (a, 6) G UJ 2 and p G (0,1). 


Proof of Lemma \5.11^ Assume indeed in this proof that (a, 6) G UI 2 and p G 
(0,1). Also just in this proof, let 


In view of (4.8), 


D2 := 




D 2 n=o ■= ° = I — 3a + 3a^ — 2a^6 + a 6 ^ + a(3 — 3a + 2ab — b'^)p, 

’ a^p 

which is of degree I in p. So, without loss of generality p G {0,1}. 

Next, let us show that D 2 ;q=o,p=o = 1 + a^(3 — 26) — a(3 — 6 ^) ^ 0. Since 
dbD 2 -,q=o,p=o = 2.a{b—a) < 0, without loss of generality 6=1. But i? 2 ; 9 =o.p=o|f,_i 
(1 — a)^ 0 , which confirms that D 2 -,q=o,p=o ^ 0 - 

Finally, note that D 2 -q=o,p=i = 1^0. 

This completes the proof of Lemma |5.12[ □ 

Lemma 5.13. D 2 ;q=i > 0 for all (a, 6) G UJ 2 and p G (0,1). 

Proof of Lemma \5.13\ Assume indeed in this proof that (a, 6 ) S W 2 = (1, 00 ) x 
(0,1) and p G (0,1), unless otherwise indicated. In view of ( |4.8[ ), 

D2-,q=i = a^(f2b - 3)(p - l)p - 0 ^( 6 ^ - 3)(p - I)p 

+ a2(36 - 36^ + 6 ^ + p - 66 p + Qb'^p - 2b^p) + 2 a (6 - 1)^6 - (6 - ifb^, 


which is a polynomial in a, 6,p of degree 4 in a. 

The correspondence 

[1, 00 ) X [0,1] X [0,1] =: 0 9 (a, 6,p) ^(i, &,p) e © := (0,1] x [0,1] x [0,1] 

(5.9) 
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is bijective. It maps the unbounded set [1, oo) x [0,1] x [0,1] onto the set (0,1] x 
[0,1] X [0,1], whose topological closure is of course the closed unit cube [0,1]^, 
which is conveniently bounded. Moreover, 


D2-g=l ■— 

is a polynomial in A, b,p of degree 4 in A, which is denoted by der 2 qlA in the 
Mathematica notebook der2ql.nb. Clearly, 0 for all (a, 6 ) € UJ 2 and 

p G (0,1) iff D 2 -q=i ^ 0 for all {A, h,p) G [0,1]^. However, the proof of the latter 
statement, that D 2 -q=i for {A,b,p) G [0,1]^, is more convenient to conduct in 
the original variables a, h,p and for the original polynomial D 2 -q=i - except for 
the case when A = 0 (informally corresponding to the “case” when a -G oo), 
that is, except for the case when {A, b,p) G {0} x [0,1]^. 


The latter, exceptional case, is in fact very easy. Indeed, D 2 ;q=i 


{3-2b){l-p)p ^ 0 . 

Turning now away from the exceptional case, 
respectively, the interior and the boundary in K" 


A=o 


let inti? and bdi? denote, 
of any set i? C Then, 
under the bijective correspondence (5.9), int© and bd© precisely correspond 
to int© and (bd©) \ ({0} x [0,1]^), respectively. Thus, it remains to show that 
D 2 -q=i is nonnegative at all points {a,b,p) G bd© and at all critical points 
(a, b,p) G int ©. 

If has a local extremum at some point {a,b,p) G int © = wi x (0,1), 

then at this point 


daD 2 -q=i =a^p^{8ab — 12a — 36^ + 9) 

- ap{8a^b - 12 a^ - 8ab'^ +9a + 46^ - 126^ + 126 - 2 ) 
+ 2b{ab^ — 8ab + 3a + 6 ^ — 36^ + 36 — 1) = 0, 
dbD 2 ;q=i =2a^p^{a — b) — 2a^p(a^ — a 6 + 36^ — 66 + 3) 

+ (6 - 1)2 (30^ + 8ab -2a- 56^ + 26) = 0, 

dpD2 q-i 2^2 _ 2a?b + 66^ + ab^ — 26^ 

a^ 

+ 2a(3 — 3a + 2ab — b^)p = 0; 


these three displayed expressions are denoted respectively by D100der2ql, D010der2ql 
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D001der2ql in the Mathematica notebook der2ql.nb. So (cf. (5.3)), 


101 (^a^ 2 ;q —1 j ^^2 1 )(^; 5) 

=4a^(3 - 26)3 + lla^(26 - 3f(b^ - 3) - 2a^(2b - 3)(56^ - 306^ - 46 + 51) 
+ a^(356® - 1926® + 4296^ - 552b^ + 4176^ - 246 - 153) 

- 4a(62 - 3) ( 86 ® - 366^ + 6063 _ 45^2 ^^b + 3) 

+ ( 6 ^ - 3)(126® - 486® + 606'‘ - 1263 _ 24^2 ^2b + 1) = 0, 

oil Rp{S^h 02 -^q—\i ^ dp 02 -q—l^(^Cl^ b'j 

= - a®(26 - 3)2 + a^{2b - 3){4b^ - 36 - 6 ) - a'‘(62 - 3)(562 - 66 - 3) 

+ 03(656® - 3366"^ + 66663 _ Q28b'^ + 2736 - 36) 

+ 0^(55 - 3006 + 31262 4qq^3 _ 930^4 ggg^s _ ^^ 24 ^ 6 ) 

+ a(646^ - 2286® + 486® + 6986^ - 9846® + 42062 - 6 - 18) 

- 2(6 - 1)26(56 - 2)(62 - 3)2 = 0 , 


res :=i?6(res^°3,res®33)(a) 

= -64(a - I)3®a2(20a3 - 84a2 + 132a - 71)^ 

X (177370200000a^^ - 1259087910000a^® + 3655379856375a^® 

- 5140724114275a3^ + 2772365717265a^3 _ I047016242513a32 

+ 12034314857519a^3 - 35530102580211a3° + 50669964404745a® 

- 454713983795530® + 30834049368285a^ - 17201730242769a® 

+ 7858116810251a® - 2832129730923a^ + 749567464029a3 

- 130666203737a2 + 12938625819a - 540087699) = 0; 

these three resultants are denoted respectively by reslOlp, resOllp, res in the 
Mathematica notebook der2ql.nb. The only root a G (l,oo) of the resultant 
res is the only root a G (l,oo) of the equation 200 ® — 84a2 + 132a — 71 = 0, 
which is a = a* := I (7 + 3 X 2“®/® — 2®/®). Then the only root 6 G (0,1) of the 
equation res®®® = 0 is 6 = 6 * := 1 — 2“®/®. Next, for any p G (0,1) one has 
02;q=i = * 2 M (28 + 26 X 2®/3 + 17 X 2^/3) > 0. Thus, B 2 ;q=i > 0 at all 

the possible critical points (a, b,p) S int 0 = wi x ( 0 , 1 ). 

So, it remains to show that D 2 -q=i ^ 0 for all (a, b,p) G 0)2 x [0,1] such that 
p G {0,1} or (a, 6 ) is on the boundary of a; 2 ; the latter, boundary condition on 
(a, 6 ) means that 

(a = 1,0 ^ 6 ^ 1 ) or (6 = 0 , a ^ 1 ) or (6 = 1 , a ^ 1 ). 

Now indeed, 

02-q=i,p=o = D 2 -q=i = 6[(1 — 6)3(6 — 2a) + a2(3 — 36 + 62)] 
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is convex in a. At that, D 2 -q=i^p=o\^_^ = h{l + 46 — 86^ + 56^ — 6“^) ^ 0 and 
daD 2 -,q=i,p=[j\^^^ = 2[2(1 — 6^) + 6^] ^ 0 for 6 G [0,1]. So, £* 2 ;(j=i,p=o ^ 0- 
Next, £> 2 ; 9 =i,p=i = D 2 -q=i\^^^ = (« “ “ b)^ > 0. 

Further, 

D2;q=l,a=l = = F^l;q=l,a=l ^ 0; 


by (5.6) and (5.7). 

Finally, D 2 -q=i\j^^Q = a^p[l + 3a(a - 1)(1 - p)] ^ 0 and F> 2 ;g=i 
0^(1 — p)[l — p + p(a — 1)^] ^ 0. 

This completes the proof of Lemma |5.13| 


b=l 


□ 


Lemma 5.14. D^.p^o ^ 0 for all (a, 6 ) G wa and q G (0,1). 

Proof of Lemma 5.14 In view of ( |5.5[ ), D 3 ;p=o = 6^g[(?a^ + (l —q)(a—6)^], which 
is manifestly nonnegative for all (a, b) G uis and g G [0,1]. ^ 


□ 


Lemma 5.15. D^.p^i ^ 0 for all {a, b) G W 3 and q G (0,1). 

Proof of Lemma \5.15\ In view of ( |5.5| ), D^.p^i = 6^(1 —g)[(l —g)a^ + g(a —6)^], 
which is manifestly nonnegative for all (a, 6) G UJ 3 and g G [0,1]. □ 


Lemma 5.16. D^.q^o ^ 0 for all (a, 6) G UJ3 and p G (0,1). 

Proof of Lemma [5. 1 6} In view of ( |5.5| ), = a^p[p6^ + (l—p)(a—6)^], which 

is manifestly nonnegative for all (a, 6) G UI 3 and g G [0,1]. □ 


Lemma 5.17. D^.q^i ^ 0 for all (a, 6) G wa and p G (0,1). 


Proof of Lemma \5.11\ In view of (5.5), IIa;<j=i = a^(l—p)[(I—p)6^+p(a —6)^], 
which is manifestly nonnegative for all (a, 6) G 0J3 and p G [0,1]. □ 

Now Lemma [3?^ follows immediately from Lemmas |5.IH5.17] 
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